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NOVEL NUCLEIC ACmS AND POLYPEPTIDES 

1. TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides . and proteins, for example in 
therapeutic, diagnostic and research methods. 



2. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 
polynucleotides "directly" in the sense that they rely on information directly related to the 
discovered protein (/. e. , partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
1 5 "indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 
based on the presence of a now weU-recognized secretory leader sequence moti^ as well as 
various PCR-based or low stringency hyhridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 
20 case of leader sequence cloning, by virtue of their ceU or tissue source in the case of PCR-based 
techniques, or by virtue of structural siipilarity to other genes of known biological acti^^^ 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 
25 and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositionsof the present invention include novel isolatedpolypeptides,novel 

isolatedpolynucleotidesencodmgsuchpolypeptides,includingrecombinantDNAmolecuIes. 

30 clonedgenesordegeneiatevariantsthereofiespecidlynatmallyoccurringvariantssu 

variants, antisensepolynucleotidemolecules. and antibodies that specifically 

epitopespresent on suchpolypeptides, as weU as hybridomasproducing such antibodies. 

The compositionsofthe present inventionadditionallyincludevectoi^. including expression 

vectors. containinglhepolynudeotidesof the invention, cells geneticaUyengineeredtoconteinsud^ 
polynucleotides and ceUs geneticaUy engineered to express such polynucleotides. 
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The presentinventionrelates to acoUectioDor library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more pubUc databases. 
The inventionrelates also to the proteins encoded by such polynucleotides, along with therapeutic, 
5 diagnosticandreseaichutiUtiesforthesepolynucleotidesandproteins. These nucleic acid 

sequences are designated as SEQ ID NO: 1-1350. The polypeptides sequences are designated SEQ 
ID NO: 1351-2700. The nucleic acids and polypeptides are provided in the Sequence Listing. In 
the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is guanine; T is 
thymine; and N is any of the four bases, hi the amino acids provided in the Sequence Listing, * 

1 0 corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
hybridize to the complement of SEQ ID NO:1-1350 under stringent hybridization conditions; 
nucleic acid sequences which are alleUc variants or species homologues of any of the nucleic add 
sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 

15 domainomuncationofthe peptides encodedby SEQ ID NO:1-1350. A polynucleotide 

comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 
IDNO:1-1350 oradegeneratevariantorfragmentthereof The identifying sequence can be 100 
base pairs in length. 

The nucleic acid sequences of tiie present invention also include tiie sequence mfoimation " 
20 from Ihe nucleic acid sequencesof SEQ ID NO:1-1350. Hie sequence infoimation can be a 
segmentof any one of SEQ ID NO: 1-1350 that uniquely identifies or representsthe sequence 
informationofSEQIDNO:l-1350. 

A coUectionas used in this application can be a collcctionof only one polynucleotide. The 
collection of sequence informationor identifying informationof each sequence can be provided on 
25 a nucleic acid array. lnoneembodiment,segmentsofsequenceinfoimationisprovidedona 

nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detectfuU-matahormismatchtothepolynucleotidethatcontainstiie segment The collection 
can also be provided in a coaq)Uter-readablefonnaL 

This invention also includes the reverse or direct complement of any of Ihe nucleic acid 
3 0 sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed wilh these expression vectors. Nucleic acid sequences (or their 
reverse or direct complements) accoidingto the inventionhave numerous applications in a variety 
of techniquesknown to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PGR, use inananay.use incomputer-readablemedia,use in sequencing 
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M-length genes, use for chromosome and gene mapping, use in the recombinant pnxluction of 
protein, and use in the generationof anti-senseDNA orRNA, their chemical analogs and the like. 
In a preferred embodiment, the nucleic acid sequences of SEQ E) NO: 1 -1 350 or novel 

segmentsor partsofthenucleicacids of the inventionareused as primers in expressionassays that 
5 arewellknownintheart In a particularlyprefeired embodiment, the nucleic acid sequences of 
SEQroNO:l-1350 or novel segments or parts of the nudeicacidsprovidedheieinare used in 
diagnostics for identifying ejq)ressed genes or, as wcU ImoAm in the art and exempMed by 

etal..Sdence258:52.59(1992),asexpressedsequencetagsforphysicalmappingofthehu^ 
gaiome. 

10 The isolated polynudeotidesof flie inventioninclude, but are not limited to, a 

polynucleotide comprising any one of the nucleotide sequences set forth in SEQ E> NO:1-1350; a 
polynucleotidecomprisinganyofdiefuUlengthproteincodingsequeacesof SEQroNO:! - 1350; 
andapolynudeotidecomprising any of the nucleotide sequences of the matureprotdncoding 
sequencesofSEQIDNO: 1- 1350. Thepolynucleotidesofthepresentinventionalso include, but 
are not limited to. apolynucleotidethathybridizesunderstringcntlQrbridizationcon^ (a) 
the complement 6f any one of the nucleotide sequences set forth in SEQ ID NO:1-1350; (b) a 
nucleotide sequence encoding any one of the amino add sequences set foitii in the Sequence Listing 
(e.g., SEQ ID NO: 1351-2700);(c)apolynucleotidewhidi is an allelic variant of any 
polynucleotidesrecited above;(d)apolynucleotide which encodesaspecieshomolog(e.^. 
orthologs)ofanyoflheproteinsTecited above; or(e)apolynucleotidethatencodesapolype^ 
comprising a specific domain or tnmcationof any of the polypeptides comprising an amino add 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising any of the amino acid sequences set forth in the Sequence Listing; orthe corresponding 
25 full length or mature protein. Polypeptides of the invention also include polypeptides with 

biological activity that are encoded by (a) any of thepolynucleotideshavinganucleotidesequence 
set forth in SEQ ID NO:1-1350; or (b) polynucleotides that hybridize to the complement of fte 
polynudeotidesof(a)understringenthybridizationconditions.BiologicallyorimmunologicaUy 
active variantsof any of the polypeptide sequences in the SequenceListing, and "substantial 
equivalents"thereof (e.g., with at least about 65%, 70%, 75%, 80%, 85%, 90%. 95%, 98% or 99% 
amino add sequence identity) that preferably retain biological activity are also contemplated. The 

polypeptidesof the inventionmay be whoUy or partially chemically synthesizedbut are preferably 
produced by recombinantmeans using the genetically engineered cdls (e.g. host cells) of the 
invention. 



20 



30 



3 



wo 01/57188 



PCT/USOl/03800 



The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The invention also provides host cells transformed or transfected with a polynucleotide of 
5 the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
1 0 protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
techniques known to those skilled in tiie art of molecular biology. These techniques include use 
as hybridization probes, use as oUgomers, or primers, for PCR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
15 or RNA, their chemical analogs and the like. For example, when the expression of an mKNA is 
largely restricted to a particular cell or tissue type, polynucleotides of tiie invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
usmg, e.g., in situ hybridization. 

In other exemplary embodunents, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known m the art and 
exempUfied by Volkath et al., Science 258:52-59 (1992), as expressed sequence tags.for 
physical mapping of the human genome. 

The polypeptides according to tiie invention can be used in a variety of conventional 
procedures and methods that are currentiy applied to oflier proteins. For example, a polypeptide 
25 of the invention can be used to generate an antibody that specifically binds tiie polypeptide. Such 
antibodies, particularly monoclonal antibodies, are usefW for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
30 which comprises the step of administering to a mammatian subject a tiierapeutically effective 
amount of a composition comprising a polypeptide of tiie present invention and a 
pharmaceutically acceptable carrier. 

In particular, tiie polypeptides and polynucleotides of tiie invention can be utilized, for 
example, in metiiods for flie prevention and/or treatment of disoiders mvolving aberrant protein 
3 5 expression or biological activity. 
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The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, I 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
identification of subjects esdiibiting a predisposition to such conditions. The invention provides 
5 a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compound that binds to and forms a complex with the polynucleotide of 
int^est for a period sufBcient to form the complex and under conditions suflScient to form a 
complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invention also provides a method for detecting the polypeptides of the 
10 invention in a sample comprising contacting the sample with a compound that bmds to and form 
a complex with the polypeptide under conditions and for a period suflBcient to form the complex 
and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
15 antibodies, and optionally quantitative standards, for carrying out methods of the invention, 
furthermore, the invention provides methods for evaluating the efficacy of drugs, and 
monitoring the progress of patients, involved in clinical trials for tiie treatment of disorders as 
recited above. 

The invention also provides methods for the identijScation of compounds that modulate 

20 increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 

of the invention. Such methods can be utilized, for example, for the identification of compound 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and oflier substances that interact with (e,g,, 
bind to) the polypeptides of the invention. The invention provides a method for identifying a 

25 compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufBcient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 
sequence in the cell; and detecting, the complex by detecting the reporter gene sequence 
expression such that if expression of the reporter gene is detected the compound the binds to a 

30 polypeptide of the invention is identiiied. 

The methods of the invention also provides methods for treatment which involve flie 
administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases o 
disorders as recited herein comprising administering compounds and other substances that 

35 modulate the overall activity of the target gene products. Compounds and other substances can 
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effect such modwlation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
useful for the same functions kno\wnto one of skill in the art as the polypeptides and 
polynucleotides to which they have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are usefiil for 
a variety of applications, as described herein, including use in arrays for detection. 



10 4. DETAILED DESCRIFnON OF THE INVENTION 

4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

15 The term "active" refers to those forms of the polypeptide which retain the biologic 

and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologicaUy active" or "biological activity" refer to a piotem or peptide 
having structural, regulatory or biochemical fimctions of a naturally occurring molecule. 
Likewise "immunologically active" or "unmunological activity" refers to the capability of the 

20 natural, recombinant or synttietic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

Hie tenn "activated ceUs" as used m this appUcation are those ceUs which are engaged in 
extracellular or intraceUuIar membrane trafficking, including the export of secretory or 
enzymatic molecules as part of a normal or disease process. 

25 The terms "complementary" or "complementarity" refer to tiie natural bindmg of 

polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5'. Complementarity between two single-stranded molecules 
may be "partial" such that only some of tiie nucleic adds bind or it may be "complete" such tiiat 
total complementarity exists between the single stranded molecules. The degree of 

30 complementarity between the nucleic add strands has significant effects on the efficiency and 
strength of the hybridization between die nudeic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell tiiat can give rise to many 
difeientiated cell types in an embryo or an adult, including tiie germ cells. The term "gemi line 
stem cells (GSCs)" refers to stem ceUs derived from primordial stem cells that provide a steady 

35 and continuous soureeofgemiceUs for the production of gametes. TTie term "primordial gerai 
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ceUs (PGCs)" refers to a small populatioa of ceUs set aside from other cell lineages particularly • 
, from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into genu cells and other cells. PGCs are the soun« from which GSCs and ES ceUs 
are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
5 not only populate the geim line and give rise to a plurality of terminaUy differentiated ceUs that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides which 
modulates the expression of an operably Imked ORP or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
10 sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not Umited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

-me terms "nucleotide sequence" or- "nucleic acid" or "polynucleotide" or 

15 "oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 
sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to ^y DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A. C, G or T 

20 (U). It is contemplated that where the polynucleotide is RNA. the T (thymine) in the sequences 
provided herein is substituted with U (uracU). Generally, nucleic acid segments provided by this 

■ invention may be assembled from fragments of the genome and short oHgonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 
acid which is capable of being expressed in a recombinant franscriptional unit comprising 

25 regulatory elements derived from a microbial or viral operon. or a eukaryotic gene. 

TTie terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 
residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 11 nucleotides and 

30 most preferably at least about 17 nucleotides. Hie fragment is preferably less thah about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 1 00 
nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is fiom about 6 nucleotides to about 200 nucleotides. 

preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 
35 nucleotidesandmostpreferablyfromabout20to25nucleotides. Preferably the fragments can 
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be used in polymerase chain reaction (PGR), various hybridization procedures pr microanay 
procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A 
fiagment or segment may uniquely identify each polynucleotide sequence of the present 
invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
5 IDNOs:l-1350. 

Probes may, for example, be used to determine whether specific mRNA molecules are 
present in a ceU or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
described by Walsh et al. (Walsh, P.S. et al., 1992, PGR Methods Appl 1 :241-250). They may 
be labeled by nick translation. Klenow fill-in reaction, PGR, or other methods well known in the 
10 art. Probes of the present invention, their preparation and/or labeling are elaborated in 
Sambrook, J. et al., 1989, Molecular Gloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, NY; or Ausubel, F.M. et al.. 1989, Current Protocols in Molecular Biology, John 
WUey & Sons. New YorkNY, both of which are incorporated herein by reference in their 
entirety. 

15 The nucleic acid sequences of the present invention also include tiie sequence 

information from the nucleic acid sequences of SEQ E) NO:1-1350. The sequence information 
can be a segment of any one of SEQ ID NO:1-1350 that uniquely identifies or represents the 
sequence information of that sequence of SEQ ID NO:1-1350. One such segment can be a 
twenty-mer nucleic acid sequence because the probability that a twenty-mer is fiiUy matched in 

20 the human genome is 1 in300. In tiie human genome, there are three biUion base pairs in one set 
of chromosomes. Because 4^° possible twenty-mers exist, there are 300 times more twenty-mers. 
than there are base pairs in a set of human chromosomes. Using the same analysis, the 
probability for a seventeen-mer to be fiilly matched in the human genome is approximately 1 in 
5. When these segments are used in arrays for expression studies, fifteen-mer segments can be 

25 used. The probability that tiie fifteen-mer is fiilly matched in the expressed sequences is also 
approximately one in five because expressed sequences comprise less than approximately 5% of 

the entire genome sequence. 

Similarly, when using sequeuce information for detecting a single mismatch, a segment can 
beatwenty-fivemer. The probability that the twenty-fivemer would appear inahuman genome 
30 withasinglemismatchis calculated by multiplying the probability for a fiill match (1^^) times the 
increasedprobabilityformismatchateachnucleotideposition(3x25). nieprobabiUtythatan 
eighteen mer witii a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probabiHty that a twenty-mer with a single mismatch can be 
detected in a human gmome is approximately one in five. 
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The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
Uno acids without any termination codons and is a sequence translatable into protein. 

Hie terms "operably linked" or "operably associated" refer to fimctionally related nucleic 
acid sequences. For example, a promoter is operably associated or operably linked with a coding 
5 sequence if the promoter controls the transcription of the coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 
elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
transcription/translation of the coding sequence. 

The term "pluripotent" refers to tiie capability of a cell to differentiate into a number of 
10 differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
differentiation capability in comparison to a totipotent cell. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 
peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
15 acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
preferably at least about 9 amino acids and most preferably at least about 17 or more amino 
acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
. than 1 50 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that, 
have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 
25 The term "translated protein coding portion" means a sequence which encodes for the full 

length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
or protein without a signal or leader sequence. Hie "mature protein portion" nieans that portion 
of tiie protein which does not include a signal or leader sequence. The peptide may have been 
30 produced by processing in the cell which removes any leader/signal sequence. The mature 
protein portion may or may not include the initial methionine residue. The methionine residue 
may be removed from the protein during processing in the ceU. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 
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can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding aflEmities, interchain affinities, or degradation/turnover 
rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
for expression, scale up and the like in the host ceUs chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eUminate disulfide bridges. 

The temis "purified" or "substantially purified" as used herein denotes that the indicated 
nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g,, polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 
preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1 000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated fi:om 
at least one other component nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 
polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protem, means 
that a polypeptide or protein is derived from recombinant (g.g., microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 
bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or 
proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 
comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 
having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for use 
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in yeast or eukaryotic expression systems preferably include a leader sequence eoabling 
extracellular secretion of translated protein by ahost ceU. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
methionine residue. This residue may or may not be subsequently cleaved fiom the expressed 
5 recombinant protein to provide a final product 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or cany the recombinant 
transcriptional unit extrachromosomally. Recombinant expression systems as defined herein >yill 
express heterologous polypeptides or proteins upon mductionof the regulatory elements linked 
10 to the DNA segment or synthetic gene to be expressed. This.lerm also means host ceUs which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 
gene expression, for example, promoters or enhancers. Recombinant expression systems as 
defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed The cells 
15 can be prokaryotic or eukaryotic. 

The term "secreted" includes a protem that is transported across or through a membrane, 
including transport as a result of signal sequences in its amino add sequence when it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (eg., soluble proteins) or partiallyCcg.. receptors) from the ceU in which Ihey are 
20 expressed. "Secreted" proteins also include without limitation proteins flial are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins contaming non-typical signal sequences (e.g. Interleukin-l Beta, see Krasney, 
PA and Young, P.R. (1992) Cytokine 4(2):134 -143) and fectors released fiom damaged ceUs 
(e.g. Interleukin-l Receptor Antagonist.see Arend,W.P. et al. (1998) Annu. Rev. Immunol 

25 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a ceU. Such a sequence 
may be naturally present on the polypeptides of the present invention or provided &ota 
heterologous protem sources by recombinant DNA techniques. 

30 The term "stringent" is used to refer to conditions that are commonly understood in the 

art as stringent Stringent conditions can include highly stringent conditions (».e., hybridization, 
to filter-bound DNA m 0.5 M NaHP04, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
65»C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent conditions (ie.. 
washing in 0.2X SSaO.1% SDS at Al'Q. Other exemplary hybridization conditions are 

35 described herein in the examples. 
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In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37*'C (for 
14-base oligonucleotides), 48T (for 17-base oligos), 55T (for 20.base oligonucleotides), and 
60T (for 23-base oligonucleotides). 
5 As used herein, "substantially equivalent" can refer both to nucleotide and amino add 

sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net efifect of which does not result in an adverse 
functional dissunilarity between the reference and subject sequences. TypicaUy, such a 
substantially equivalent sequence varies from one of tiiose listed herem by no more than about 
10 35% (/.e., the number of individual residue substitutions, additions, and/or deletions in a 

substantially equivalent sequence, as compared to the corresponding reference sequence, divided 
by the total number of residues in ttie substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. In one ■ 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 
15 listed sequence by no more than 30% (70% sequence identity); in a variation of this 

embodiment, by no more than 25% (75% sequence identity); and m a further variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a fiirther variation of this 
embodiment, by no more than 1 0% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, 
20 mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity witii a listed ammo acid sequence, more preferably at least 85% sequence identity, more ' 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 
nucleotide sequences of the invention can have lower percent sequence identities, taking into 
25 account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
30 about 99% sequence identity. For the purposes of tiie present invention, sequences having 

substantially equivalent biological activity and substantially equivalent expression characteristics 
are considered substantially equivalent. For tiie purposes of determim'ng equivalence, truncation 
of the mature sequence {e.g. , via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be detennined, e.g. , using tiie Jotun Hein metiiod (Hein, J. 
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(1990) Methods Enzymol. 183:626-645). Identity between sequences can also be detennined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cefl to differentiate mto all of the ceU 

types of an adult organism. • 
5 Hie tenn "transformation" means introducing DNA into a suitable host ceU so that tiie 

DN A is replicable, either as an extrachromosomal element, or by chromosomal integration. The 

term"transfection" refers to the takingupof an expression vector by a suitable host cell, whether. 

or not any coding sequences ate in fact expressed. Hie term "infection" refers to the mtroduction 

of nucleic acids into a suitable host ceU by use of a virus or viral vector. 
10 As used herein, an "uptake modulatmgfragmenVUMF, means a series of nucleotides 

whichmediatetheuptakeofalinkedDNAfragmentmtbaceU. UMFs can be readily identified 
using known UMFs as atarget sequence or target motif with the computer-based systems 
described below. Tlie presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
15 with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
determmed. As described above, a UMF will mcrease the fi«iuency of uptake of a Imked 
marker sequence. 

Each of the above tenns is meant to encompass all that is described for each, unless the 
. context dictates otherwise. 

20 

42 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

Hie isolated polynucleotides of the invention include a polynucleotide comprising the 
nucleotide sequences of SEQ ID NO:1.1350 ; a polynucleotide encoding any one of the peptide 
25 sequences of SEQIDNO:1351-2700; and a polynucleotide comprismg the nucleotide sequence 
encoding the mature protem codmg sequence of the polypeptides of any one of SEQ ID 
NO:1351-2700. Tlie polynucleotides of the present mvention also mclude.but are not limited to. 
a polynucleotide that hybridizes under stringent conditions to (a) the complement of any of the 
nucleotides sequences of SEQ ID NO:1-1350 ; (b) nucleotide sequences encoding any one of the 
30 amino acid sequences set forth in the Sequence Usting; (c) apolymicleotide vdiichis anallcUc 
variant of any polynucleotide recited above; (d) a polynucleotide which encodes a species 
homolog of any of the proteins recited above; or (e) a polynucleotide that encodes apolypeptide 
comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 1351-2700. 
Domains of mterest may depend on tiie nature of the encoded polypeptide; e.g., domains in 
35 iweptor-like polypeptides include Ugand-bindmg, extiaceUular, transmembrane, or cytoplasmic 
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domains, or combinations thereof; domains in immunoglobulin-like proteins include Has variable 
inununoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and 
substrate binding domains; and domains in ligand polypeptides include receptor-binding 
domains. 

5 The polynucleotides of the invention include naturally occurring or wholly or partially 

synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g.. mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
region of the cDNA. 

The present invention also provides genes coirespondingto the cDNA sequences disclosed 
10 herein. The corresponding genes can be isolated in accordance with imovmrneftods using the 
sequence information disclosed herein. Such mefliods include tiie preparation of probes or primers 
from the disclosed sequence informationfor identificationand/or amplification of genes in 
appropriate genomic libraries or odier sources of genomic materials. Furthssr 5' and 3' sequence can 
be obtained using methods known in the art. For example, fuU length cDNA or genomic DNA that 
15 conespondstoanyofthepoIynudeotidesofSEQIDNOrl-lSSO can be obtained by screening 
appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of 
thepolynucleotidesofSEQIDNO:l-l350oraportionthereofasaprpbe. Altemarively.tiw 
polymicleotidesof SEQ ID NO:l-l350may be used as Hie baas for suitable primer(s)that allow 
idenlificationand/or amplification of genes m appropriate genomic DNA or cDNA libraries. 
20 The nucleic add sequences of flie invention can be assembled fitan ESTs and sequences 

(including cDNA and genomic sequ«ices) obtained fi:om one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence infarmatiOT, 
lepresentativefiagmentor segment information, or novel segment informationfor the fuU-length 
gtme, 

25 The polynucleotides of the invention also provide polynucleotides including nucleotide 

sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 
75%, at least about 80%, 81%, 82%, 83%, 84%, more typicaUy at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91 %. 92%, 93%, 94%, and even more typically at 

30 least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of tiie nucleic acid sequences of the invention are nucleic acid 
sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ID NO:1-1350, or complem«aits thereof, which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 

35 preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 
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are selective for (i.e. specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the mvention from other polynucleotide sequences in 
ihe same family of genes or can differentiate hunuin genes from genes of other .species, and are 
5 preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are hot limtedto tiiese 
specific sequences,but also include alleUc and species variafionsthereof. AlleUc and species 
variations can be routinely detemiinedby comparingihe sequence provided SEQ ID NO:1-1350, a 
representative&agmentthereof,oranucleotidesequenceat least 90% identical.preferab^^ 

10 identical,to SEQ ID NO:1-1350 with a sequence from another isolate of the same species. 
Furthermore, to accommodate codonvariabiUty.theinveiition includes nucleicacid molecules 
codingforthesameaminoacidsequencesasdolhespedficORFsdisclosedherein. Inotherwords, 
in Ihe codingregionof an ORF.substitutionof one codonfor another codonthatencodesthe same 

amino acid is e}q)ressly contemplated. 
15 Thenearestneighbororhomologyiesultforthenudeicacidsofthepresentinveotion, 

including SEQ ID NO:1-1350, can be obtained by searching a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Aligmnent Search Tool is used to 
searchforlocalsequencedigmnents(Ahshul.SJ.JMol.Evol.36290-300(1993)andAltschul 
SJ.etal. J. Mol. Biol. 21:403-410 (1990)). Alternatively a FASTA version 3 search against 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orlhologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nufileic 

acid source from the desired species. 
25 The invention also encompasses allelic variants ofihe disclosed polynucleotides or 

proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which also 

encode proteins which are identical, homologous or related to that encoded by the 

polynucleotides. 

The nucleic acid sequences of the invention are fiirther duected to sequences which 
30 encode variants of tiie described nucleic adds. TTiese amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. Tliere are two variables in the construction of amino add 
sequence variants: the location of the mutation and the nature of the mutation Nuoldc acids 
encodmg the amino acid sequence variants are preferably constructed by mutating the 
35 polynudeotide to encode an amino add sequence that does not occur in nature. These nucleic 
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acid alterations can be made at sites that differ in the nticleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
wiU typicaUy be modified in series, e.g.. by substituting first with conservative choices (e.g., 
hydrophobic amino acid to a different hydrophobic amino acid) and then wilb more distant 
5 choices (e.g., hydrophobic amino acid to a charged amino acid), and then deletions orinsertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 1 0 residues, and are typically contiguous. Amino acid 
insertions include amino- and/or carboxyl-tenninal fusions ranging in length firom one to one 
hundred or more residues, as weU as intrasequence insertions of single or multiple amino acid 
10 residues. Intrasequence insertions may range generally &om about 1 1o 10 amino residues, 

preferably from 1 to 5 residues. Examples of terminal insertions inchide the heterologous signal 
sequences necessary for secretion or for intracellular targeting in different host cells and 
sequences such as FLAG or poly-histidine sequences usefiil for purifying the expressed protein. 
In a preferred method, polynucleotides encodmg the novel amino acid sequences are 
15 changed via site-directed mutagenesis. This method uses oUgonudeotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as weU as sufficient adjacent 
nucleotides on both sides of the changed amino acid to form a stable duplex on dther side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are vrell known to 
those of skill in the art and this technique is exemplified by pubUcations such as, Edehnan et al., 
20 DNA 2:183 (1983). A versatile and efficient method for producii« site-specific changes in a 
polynucleotide sequence v^ras published by ZoUer and Smith. Nucleic Acids Res. 10:6487-6500 . 
(1982). PGR may also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primer(s) that differs 
slightly in sequence from the corresponding region in the template DNA can generate the desired 
25 amino acid variant. PGR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fiagmeKtsrtqjlace the corresponding region in the plasmid and this 

gives a polynucleotide encoding the desired amino acid variant 

A fiirther technique for generating amino acid variants is the cassette mutagenesis 
30 technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques weU 
known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current 
Protocols inMolecular Biology, Aasvbelttd. Due to Ihe inherent degeneracy ofthe genetic 
code, otiier DNA sequences which encode substantially tiie same or a fiinctionally equivalent 
amino acid sequence may be used in ti»e practice of tiie invention for the cloning and expression 
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• of these novel nucleic acids. Such DNA sequences include those which'are c^le of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 
to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
5 domains ofthemvention and heterologous protein sequences. 

Hie polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well knovm 
to those of skill in the art and can include, for example, methods for determining hybridization 
10 conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the mvention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO: 1-1350, or functional 

equivalents thereoi^ may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof in appropriate host cells. Also 

15 included are the cDNA inserts ofanyofthe clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Usefiil 
nucleotide sequences for joinmg to polynucleotides include an assortment of vectors, e.g., 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are weU known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 
selectable marker for the host ceU. Vectors according to the invention include expression 

25 vectors, replication vectors, probe generation vectors, and sequencing vectors. A host ceU 
according to the invention can bb a prokaryotic or eukaryotic cell and can be a uniceUular 
organism or part of a multicellular organism. 

Hie present invention further provides recombmant constructs comprising a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO:1-1350 or a fragment thereof or any other 

30 polynucleotides of the invention. In one embodiment, the recombinant constructs of the present 
invention comprise a vector, such as aplasmid or viral vector, mto which a nucleic acid having 
any of the nucleotide sequences of SEQ ID NO:1-1350 or a fiagment thereof is inserted, in a 
forward or reverse orientation. In the case of a vector comprising one of the ORFs of thcpresent 
invention, the vector may further comprise regulatory sequences, including for example, a 

35 promoter, operably linked to the ORF. Large numbers of suitable vectors and promoters are 
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10 



known to those of skiU in the art and aie commerdaUy avaUaWc for generating the recombinant 
constructs of the present invention. The foUowing vectors ate provided by way of example. 
Bacterial: pBs, phagescript, PsiX174, pBhiescript SK, pBs KS, pNHSa, pMI16a, pNH18a. 
pNH46a (Stiatagenc); pTrc99A. pKK223-3. pKK233.3, pDR540. pRJT5 (Pharmacia). 
Eukaryotic: pWLneo, pSV2cat, pOG44, PXTl, pSG (Stiatagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMr2 or pED expression vectors disclosed in Kaufinan et al., 

Nucleic Acids Res. 19, 4485-4490 (1991). in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufinan, Methods in 
Enzymology 185, 537-566 (1990). As defined herein "operably Imked" means that the isolated 
polynucleotide of the mvention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host ceU which has been transformed 
15 (transfected) with the Ugatedpolynucleotide/expression control sequence. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markeis. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include laci, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 
20 kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionem-I. Selection of 
the appropriate vector and promoter is well within the level of ordinaiy skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers pemiitting transformation of the host cell, e.g., the ampiciUin resistance gene of £. coli 
■ and S. cerevisiae TRPl gene, and a promoter derived from a highly-expressed gene to direct 
25 transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 
assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 
30 periplasmic space or extracellular medium. OptionaUy, tiie heterologous sequence can encode a 
fiision protein including an amino teraninal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed recombinant product. 
Useful expression vectors for bacterial use are constructed by inserting a stractuial DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 
35 signals in operable reading phase witii a fimctional promoter. The vector wUl comprise one or 
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more phenotypic sdectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host Suitable prokaryotic hosts for 
transformation include E, colt. Bacillus subtilis. Salmonella typhimurium and various species 
wthin the genem Pseudomonas, Streptomyces, and Staphylococcus, although others may also be 

S employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived fix)m commercially 
available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 

10 Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 
sequence to be e5q)iessed. Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 

15 additional period. Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for furflier purification. . 

Polynucleotidpsofihe invention can also be used to induce immune responses. For 
example, as described in Fan et aL, Nat Biotech, 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against flie encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of flie DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 

25 4.3ANTISENSE 

Anotiier aspect of the mvention pertains to isolated antisense nucleic acid molecules tiiat 
are hybridizable to or complementary to tiic nucleic acid molecule comprising tiie nucleotide 
sequence of SEQ ID NO:1-1350, or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
30 encoding a protein, complementary to die coding strand of a double-stranded cDNA 

molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 
100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fiagments, homologs, derivatives and analogs of a protem of any of SEQ ID 
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NO:1351-2700 or antisense nucleic acids complementaiy to a nucleic acid sequence of SEQ ID 
NO:1-1350 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence of the invention. The tem "coding region" refers 
5 to the region of the nucleotide sequence comprising codons which are translated into amino add 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention; The temi 
"noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (ic, also refened to as 5' and 3' untranslated regions). 
10 Given the coding strand sequences encoding a nucleic acid disclosed herein (eg., SEQ 

ID NO:1-1350), antisense nucleic adds of the invention can be designed according to the rules 
of Watson and Crick or Hoogsteen base pairing. The antisense nucldc acid molecule can be 
complementary to the entire coding region of a mRNA, but more preferably is an oUgonudeotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 
15 the antisense oUgonudeotide can be complementary to the region surrounding the translation 
start site of a mRNA. An antisense oUgonudeotide can be, for exan^le, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis or enzymatic Ugation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 
20 chemically synthesized using naturaUy occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase Ihe physical stabiUty 
of die duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nudeic acid 
25 include: S-fluorouracil, 5-bromouiacil, 5-dilorouracil, S-iodouracU, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(caiboxyhydroxylmetiiyl) uradl, 5-caiboxymethylaminomethyI- 
2-thiouridine, 5-carboxymelhylaminomefliyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosme,N6-isopentenyladenine, 1-methylguanine, l-methyUnosine,2,2-dimethylguanine, 
2-methyladenine, 2-mefliylguanine, 3-mefliylcytosine, 5-methylcytosine, N6-adenine, 
30 7-mefliylguanine, 5-methylaminomethyluracil, 5-melhoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracU, 
2-methyllhio-N6-isopentenyladenine, uracU-5-oxyaoetic acid (v), wybutoxosine, pseudouracU, 
queosine, 2-thiocytosine, 5.methyl-2-thiouraca, 2-thiouracU, 4-thiouiacU, 5-methyluracU, 
uracil.5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouraca, 
35 3-(3-amino-3-N-2.caiboxypropyl) uracil, (acp3)w, and 2,6.diaminopurine. Alternatively, the 
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antisense nucleic acid can be produced biologicaUy using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (ie, RNA transcribed from the 
inserted nucleic acid wiU be of an antisense orientation to a target nuckic acid of interest, 
described further m the following subsection). 

5 The antisense nucleic acid molecules of the invention are typically administered to a 

subject or generated in situ such that tbey hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or. for example, in the case of 

1 0 an antisense nucleic add molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct it^ection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and flien administered 
systemicaUy. For example, for systemic administration, antisense molecules can be modified 

15 such that they specifically bind to receptors or antigens expressed on a selected ceU surface, e.g., 
by linking the antisense nucleic add molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be deUvered to cells using 
the vectors described herein. To achieve suffident intracellular concentrations of antisense 
molecules, vector constructs m whidi the antisense nucleic acid molecule is placed under the 

20 control ofa strong poin or polffl promote are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 

: -a nomeric nucleic add molecule. An -a nomeric nucleic acid molecule fimns spedfic 
double-stranded hybrids with complementary RNA in which, contrary to the usual -units, the 
strands run parallel to each other (Gaultier et d. (1987) l^ucleic Acids Res 15: 6625-6641). The 
25 antisense nucldc acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al. 

(1987) Nudeic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue et al. (1987) 
FEBSLett2l5: 327-330). 

4.4 WBOZYMES AND PNA MOIETIES 
30 In stai another embodiment, an antisense nucleic add of the invention is a ribozyme. 

Ribozymes are catalytic RNA molecules v«th ribonuclease activity that are capable of cleaving a. 

single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 

Thus, ribozymes {e.g.. hammerhead ribozymes (described in Haselhoff and Gerladi (1988) 

Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 
35 translation of a mRNA. A ribozyme having spedfidty for a nucldc acid ofthe invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (.-.e.. SEQ ID N0:1- 
1350). For example, a derivative of a Tetrahymena L:19 IVS KNA can be constructed in which 
the nucleotide sequence of the active site is complementaiy to the nucleotide sequence to be 
cleavedinaSECX-encodingniRNA. See,eg..Cechefa/. U.S, Pat. No. 4.987.071; and Cecher 

5 al. U.S. Pat No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a catalytic RNA 
having a specific ribonuclcase activily fix)m a pool of RNA molecules. See. e.g., Bartel et al.. 
(1993) Science 261:141 1-1418. 

Alternatively, gene expression can be inhibited by taigetmg nucleotide sequences 
complementary to the regulatory region (eg., promoter and/or enhancers) to form triple heUcal 

10 structuresthatpreventtranscriptionofthegeneintargetceUs. See generally, Helene. (1991) 
AnticmcerDmgDes^:569-M;meat.etal.{l992)Am.NXAcadSci660:27^^^^^ 

Maher (1992) Bioassays 14: 807-1 5. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, ag., the stability, hybridization, or 

15 solubUity of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 
acids can be modified to generate peptide micleic acids (see Hyrup et al (1 996) Bioorg Med 
Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, eg, DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

20 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strengtL The synthesis of PNA oUgomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 

Peny-O'Keefe etal. (1996) PNAS93: 14670-675. 

PNAsofthcinvendoncanbeusedintherapeuticanddiagnosticappUcations. For 

25 example. PNAs can be used as antisense or antigene agents for sequence^specific modulation of 
gene expression by, e.g., mducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 
gene by. e.g., PNA directed PGR clamping; as artificial restriction enzymes when used in 
combination with other en2ymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 

30 primers for DNA sequence and hybridization (Hyrup et al. (1996), above; Perry-O'Keefe (1996), 
above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 
stabiUty or cellular uptake, by attaching lipophiUc or other helper groups to PNA. by the 
formation of PNA-DNA chimaas, or by the use of liposomes or other techniques of drug 
35 delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding afBnity and specificity. PNA-DNA chimeras can be linked 
using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 

5 the nucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras 
canbeperfoimed as described mHyrup (1996) above and Finn era/. (1996) Nucl Adds Res 2A: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite couplii^ chemistry, and modified nucleoside analogs, e.g., 
5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA 

10 and the 5' end of DNA (Mag et al (1989) Nud Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' 
DNA segment (Finn et al (1996) above). Alternatively, chuneric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment See, Petersen et td. (1975) Bioorg Med Chem 
ieff 5: 1119-11124. 

15 In other embodiments, the oUgonucleotide may include other apptoded groups such as 

peptides (e.g., for targeting host ceU receptors in vivo), or agents faciUtating transport across the 
cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad. Sci. U.S.A. 86:6553-6556; 
Lemaitre et al., 1987, Proc. Natl Acad Sd. 84:648-652; PCT PubUcation No. W088/09810) or 
the blood-brain barrier (see. e.g., PCT PubUcationNo. W089/10134). In addition. 

20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g., Krol et 
al, 1988, BioTechmques 6:958-976) or intercalaidng agents. (See, e.g., Zon, 1988, Pharm. Res. 
5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage t^ent, etc. 

25 

4.5 HOSTS 

The present invention further provides host cells geneticaUy engineered to contain the 
polynucleotides of the invention. For example, such host cells may contain nucleic adds of the 
invention introduced into the host cell using known transformation, transfection or infection 
30 methods. The present invention still further provides host cells geneticaUy engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
with a regulatory sequence heterologous to the host ceU which drives expression of the 

polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to pemit, or 
35 increase, expression of endogenous polypeptide. Cells can be modified («.g., by homologous 
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recombmation) to provide increased polypeptide expression by replacing, in ^ole or in part, the 
naturaUy occurring promoter with all or part of a heterologom promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
is opeiatively Unked to the encoding sequences. See. for example. PCT International Publication 

5 No. WO94/12650, PCT International PubUcation No. WO92/20808. and PCT International 
Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g. , ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydioorotase) and/or 
introa DNA may be inserted along Avith the heterologous promoter DNA. If linked to the coding 

10 sequenccamplificationofthemaikerDNAbystandardselectionmethodsresultsinco- 

amplification of the desired protan codmg sequences in the ceUs., 

The host ceU can be a higher eukaiyotic host cell, such as a mammaUan ceU. a lower 
eukaryolic host cell, such as a yeast cell, or the host ceU can be a prokaiyotic cell, such as a 
bacterial ceU. Introduction of the recombinant construct into the host cell can be effected by 
15 calcium phosphate transfection. DEAE. dextran mediated transfection. or electroporation (Davis, 
L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of flie invention, can be used in conventional manners to produce the gene 
productencodedbyti»eisolatedfragment(inthecaseofanORF)orcanbeusedtoproducea 

heterologous protein under tiie control of the EMF. 
20 Any host^ector system can be used to express one or more of the ORFs of tiie present 

invention. These include, but are not limitedto. eukaryotic hosts such as HeLa cells. Cv-1 cell, 
COS cells, 293 ceUs. and Sf9 ceUs. as weU as prokaryotic host such as E. coli and B. subtilis. 
TTie most prefened cells are those which do not normally express the particular polypeptide or 
protein or which expresses ti«. polypeptide or protein atlow natural level. Mature proteins can 

25 be expressed in mammalian cells, yeast, bacteria, or other ceUs under tiie control of appropriate 
promoters. CeU-fiee translation systems can also be employed to produce such proteins using 
RNAs derived ftom the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use witir prokaryotic and eukaryotic hosts are described by Sambrook, et 
al.. in Molecular Cloning: A Labor^ory Manual, Second Edition, Cold Spring Harbor, New 

30 York (1989), the disclosure of which is h«a»by incorporated by reference. 

Various mammaHan ceU culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the CQS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23:175 (1981). Other ceU lines capable of expressing a 
compatible vector are, for example, the C127, monkey COS ceUs. Chinese Hamster Ovary 

35 (CHO) cells, human kidney 293 cells, human epidermal A431 ceUs, hmnan Colo205 ceUs, 3T3. 
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cells, CV-1 cells, otber transformed primate cell lines, nomial diploid cells, cell strains derived 
ftom in vitro culture of primary tissue, primary «q)lants, HeLa cells, mouse L ceUs, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors wiU comprise an origin of 
replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation 

5 site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 
nontranscribcd sequences. DNA sequences derived fiom the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, spUce, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are" usuaUy isolated by initial extraction fiom cell peUets. followed by one or 

10 more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
FinaUy. high performance Uquid chromatography (HPLC) can be employed for final purification 
steps. Microbial ceUs employed in expression of protems can be disrupted by any convenient 
method, including fieeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 

15 agents. 

Alternatively, it may be possible to produce the protem m lower eukaryotes such as yeast 
or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast stram capable of expressing heterologous proteins. PotentiaUy suitable bacterial 

20 strains include Escherichia coli. Bacillus subtilis, Salmonella typhimurium, or any bacterial 
strain capable of expressing heterologous proteins. Ifthe protein is made in yeast or bacteria, it 
may be necessary to modify the proton produced therein, for example by phosphoiylation or 
glycosylationofthe appropriate sites, m order to obtain the functional protein. Such covalent 
attachments may be accomplished using known chemical or enzymatic methods. 

25 to another embodiment of the present invention. ceUs and tissues may be engineered to 

express an endogenous gene comprismg the polynucleotides of the mvention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 
. maybereplacedby homologousrecombination. As described herem, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 

30 gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 

regulatory sequences may be comprised of prompters, enhancers, scaffold-attachment regions, 
negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the stnacture or stabiUty 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 

35 targeting. These sequence include polyadenylation signals. mRNA stabiHty elements. spUce 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protiin, or other sequences which alter or improve the ftnction or stability of piotein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatoiy sequence, placing the 
gene under the control of the new regulatory sequence, e.g.. inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatoiy element, such as the deletion of atissue-spedfic negative regulatory element. 
Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 
the naturaUy occurring elements. Here, the naturally occuning sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 
the use of one or more selectable marker genes that are contiguous with the targeting DNA. 
allowing for the selection of cells in which Ihe exogenous DNA has integrated into the host.cell 
genome. The identification of the targeting event may also be fecilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 
selectable marker is linked to ihe exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a coirect homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers usefid for this purpose include the 
Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyi-transferase (gpt) gene. 

TTie gene tai^eting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described m U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5.578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et aL; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al'., each of which is incorporated by reference 
herein in its entirety. 

4.6 POLYPEPTTOES OF THE INVENTION 

Tlie isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ID NO:1351-2700 or an 
amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-1350 or the 
corresponding fiill length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity that are encoded by: (a) a 
polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO:1-1350 or (b) 



27 



wo 01/57188 



PCTAJSOl/03800 



polynucleotides encoding any one of the amino acid sequences set forlli as SEQ ID NO:1351- 
2700 or (c) polynucleotides that hybridize to the complement of the polynucleotides of either (a) 
or (b) under stringent hybridization conditions. The invention also provides Wologically active 
or immunologically active variants of any of the amino acid sequences set forth as SEQ ID 

5 NO:1351-2700 or the conesponding faU length or mature protein; and "substantial equivalents" 
thereof (e.g., witii at least about 65%. at least about 70%, at least about 75%, at least about 80%, 
at least about 85%, 86%, 87%, 88%, 89%, at least about 90%. 91%. 92%. 93%, 94%, typicaUy at 
least about 95%, 96%, 97%, more typically at least about 98%, or most typicaUy at least about 
99% amino acid identity) that retain biological activity. Polypeptides encodedby alleUc variants 

10 may have a similar, increased, or decreased activity compared to polypeptides comprising SEQ 
IDNO:1351-2700. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein may 
be m linear form or they may be cyclized using known methods, for example, as described in H. 

15 U. Saragovi, et al.. BioA-echnology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. 
Chem. Soc. 1 14, 9245-9253 (1992), both of which are incorporated herein by reference. Such 
j&agments may be fused to carrier molecules such as immunoglobulms for many purposes, 
including increasing the valency of protein bmding sites. 

The present invention also provides both full-length and mature forms (for example, 

20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by tisnslation of tiie disclosed nucleotide 
sequences. The mature fom of such protein may be obtained by expression of a fall-length 
polynucleotide in a suitable mammalian cell or other host cell. Ths sequence of the mature form 
of tiie protein is also deteraunable from the amino acid sequence of the fuU-length form. Where 

25 proteins of 1lie present invention are membrane bound, soluble forms of tiie proteins are also 
provided. In such forms, part or aU of the regions causing the proteins to be membrane bound 
are deleted so that the proteins arc fully secreted from the cell in which they are expressed. 

Protein compositions of tiie present invention may further comprise an acceptable carrier, 
such as a hydrophilic, e.g., phaimaceutically acceptable, carrier. 

30 The present invention fiirther provides isolated polypeptides encoded by tiie nucleic acid 

fragments of tiie present invention or by degenerate variants of tiie nucleic acid fragments of tiie 
present invention. By "degeuerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of tiie present invention (e.g., an ORF) by nucleotide sequence but, due to 
tiie degeneracy of tiie genetic code, encode an identical polypeptide sequence. Preferred nucleic 

35 acid fragments of flie present invention are tiie ORFs tiiat encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino add 
sequence can be synthesized using commerciaUy available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
5 structural and/or conformational characteristics with proteins may possess biologiml properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 
example, in generating antibodies against the native polypeptide. Tims, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
10 therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 
cells which have been altered to express the desired polypeptide or protem. As usfcd herein, a 
cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
15 which the ceU normally produces at a lower level. One skilled in the art can readUy adapt 
procedures for introducing and escpressmg either recombinant or synthetic sequences into 
eukaryotic or prokaiyotic cells in order to generate a ceU which produces one of the polypeptide 
or proteins of titie present invention. 

The invention also relates to methods for producing a polypeptide comprising growing a 
20 culture of host cells of the invention in a suitable culnire medium, and purifying the protein from 
the cells or the culture m which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
aUow expression of the encoded polypeptide. The polypeptide can be recovered from the 
25 culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Prefetred embodiments include those m which the protein produced by such 
process is a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells which 
naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
30 methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or protems of the present invention. These include, but are not limited to, 
unmunochromatography, HPLC. size-exclusion chromatography, ion-exchange chromatography, 
and immuno-afBnity chromatography. See, e.g., Scopes, Protein Purification: Principles and 
Practice. Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
35 Manual; Ausubel et al.. Current Protocols in Molecular Biology. Polypeptide fragments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fiaginents that encode specific protein 
domains. 

The purified polypeptides can be used in in vitro binding assays which are well known in 
5 the art to identify molecules which bind to the polypeptides. TTiese molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
activity in in vivo tissue culture or animal models that are well known in the art In brief, the 
molecules are titrated into a plurality of ceU cultures or animals and then tested for either 
10 cell/animal death of prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to Has peptides 
may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other ceU by the 
specifidty of the binding molecule for SEQ ID NO:1351-2700. 
15 The proteiri of the invention may also be e5q»ressed as a product of transgenic animals, 

e.g.. as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
sinular to those of purified proteins but into ^siiich modification are naturally provided or 
20 deliberately engineered. For example, modifications, in tiie peptide or DNA sequence, can be 
made by tiiose skilled in the art using known techniques. Modifications of interest in tiie protein 
sequences.may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino add residue in the coding sequence. For example, one or more of the cysteme 
residues may be deleted or replaced with another amino acid to alter die confomiation of the 
25 molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to tiiose skilled mthe art (see, e.g.. U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of tiie 
protein. Regions of the protein that are important for the protein function can be determmed by 
various metiiods known in die art including tiie alanine-scanning metiiod which involved 
30 systematic substitution of siagle or stiings of amino acids with alanine, foUowed by testing tiie 
resulting alanine-containing variant for biological activity. This type of analysis determmes tiie 
importance of tiie substituted amino acid(s) in biological activity. Regions of tiie protein tiiat are 
important for protein fimction may be determined by die eMATRIX program. 

Otiier fragments and derivatives of tiie sequences of proteins which would be expected to 
35 retain protein activity in whole or in part and are usefiil for screenmg or oflier immunological 
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methodologies may also be easUy made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
invention to suitable control sequences in one or more insect expression vectors, and employing 
5 an insect expression system. Materials and methods for baculoviruatosect cell ejqaession 
systems are commercially avaUable in kit form fiom, e.g.> Invittogen, San Diego, Calif., U.S.A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
Smith, Texas Agricultural Experiment Station BuUetinNo. 1555 (1987), incoiporated herein by 
reference. As used hei«in, an insect cell capable of expressing a polynucleotide of the presort 
10 invention is "transformed." 

The protein of the invention may be prepared by cultuiing tiansfonned host cells under 
culture conditions suitable to expKss the recombinant protein. The resulting expressed protein 
may then be purified from such culture (ie., from cultare medium or ceU extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
15 of the protein may also inchide an afOnity column containing agents \*iiidi will bind to the 
protein; one or more column steps over such aiEBnity resins as concanavalin A-agarose, 
heparinrtoyopearr™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoafGnity chromatography. 
20 Altemadvely, the protein of the invention may also be expressed in a form which will 

feciUtate purification. For example, it may be expressed as a fusion protein, such as those of •. 
maltose binding protein (MBP), glutathione-S-transfeiase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for e:q)ression and purification of such fusion proteins are commercially available 
from New England BioUb (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 
25 respectively. The prolBin can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ('TLAG®") is commerciaUy 
available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 
steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 
30 aliphatic groups, can be employed to further purify the protein. Some or aU of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 
homogeneous isolated recombinant protein. The protein thus purified is substantiaUy free of 
other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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Hie polypeptides of the invention include analogs (variants). This embraces ftagments, 
as weU as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of Ihe polypeptides of the invention embrace fusions of the polypeptides or 
modifications of the polypeptides of Ihe invention, wherein the polypeptide or analog is fused to 

5 another moiety or moieties, e.g., targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stabiUty. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 
provide for the deUvery of polypeptide to pancreatic ceUs, e.g., antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 

10 as receptor and ligands expressed on pancreatic or faranune cells. Oflier moieties which may be 
fused to tiie polypeptide include therapeutic agents which are used for treatment, for example, 
immimosuppiessive drugs such as cyclosporin, SK506, azatiiioprme, CD3 antibodies and 
steroids. Also, polypeptides may be fused to immune modulators, and oflier cytokines such as 
alpha or beta interferon. 

15 

4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTTDE JDENTTTY 

ANDSIMILABITY 

Preferred identity and/or similarity are designed to give tiie largest match between tiie 

sequences tested. Methods to determine identity and similarity are codified in computer 
20 programs includmg, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group. 

University of Wisconsin, Madison. WI). BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 

et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al.. Nucleic Acids Res. 

vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 
25 Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (NeviU- 

Mamiing et al, ISMB.97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 

(Somjhammer et al., Nucleic Acids Res.. Vol. 26(1), pp. 320-322 (1998). herein incorporated by 

reference) andtiie Kyte-Doolitfle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 

105-31 (1982), incorporated herem by reference). The BLAST programs are pubUcly available 
30 from flie National Center for Biotechnology Information (NCBI) and other sources (BLAST 

Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 

BioL 215:403-410 (1990). . 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
35 protein" or "fusion protein" comprises a polypeptide of tiie invention operatively Imked to 
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another polypeptide. Wlhin a fosion protem the polypeptide according to the mveation can 
correspond to aU or a portion of a protein according to the invention. In one embodunent, a 
fusion protein comprises at least onebiologicaUyactiveportionofaprotein accord 

invention. In another embodiment, a fusion protem comprises at least two biologically active 
5 portionsofa protein according to the mvention-Withm the fusion protein, the term "o^^^^ 
Unked'-isintendedtoindicatethatthepolypeptideaccordingtothemvention andtheother • 
polypeptide are fused in-fiame to each other. The polypeptide can be fused to the N-tciminus or 

C-terminus. 

For example, in one embodiment a fusion protem comprises a polypeptide according to 
10 the invention operably linked to the extracellular domam of a second protein. 

In another embodiment, the fusion protein isaGST-fusion protein in .vhich the 

polypeptide sequences of the invention are fused to the C-terminus of the GST (f. e. , glutathione 

S-transferase) sequences. 

In another embodiment, the fusion protein is an immmioglobulin fusion protein m which 
1 5 the polypeptide sequences according to the invention comprises one or more domains are fused 
tosequencesderivedfiomamemberoftheimmimoglobulinproteinfemUy. The 
immunoglobulin fusion protems of the invention can be incorporated mto pharmaceutical 
compositions and administered to a subject to mhibit an interaction between a Ugand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vrvo. 
20 The innnunoglobulmfusionprol^ms canbeusedto affect the bioavailability of a cognate bgand. 
tohibition of the Ugand/protein interaction may be use&l therapeutically for both the treatment of 
proliferative and differentiative disorders. e,g.. cancer as well as modulating {e.g., promotmg or 
inhibiting) ceU survival. Moreover, the hmnunoglobuUn fusion protems oftiie invention can be 
used as immunogens to produce antibodies inasubject,t« purify Ugands.andinscree^^ 
25 toidentifymolecdesthatmMHttheinteractionofapolypeptideoftheinventionw^^ 

Achimeric or fusion protem of the invention can be produced by standard recombmant 

DNA techniques. For example. DNA fragments codmg for the different polypeptide sequences 
are Ugated together m-fiame in accordance with conventional techniques. e.g. , by employing 
blunt-ended or stagger^ded termini for ligation, restriction enzyme digestion to provide for 

30 appropriate termmi..fillmg-in of cohesive ends as appropriate, alkalme phosphatase treatment to 
avoidundesirablejoining, and enzymatic Ugation. to anotiier embodiment, the fusion gene can 
be synthesized by conventional techniques mcluding automated DNA syntiiesizers. 
Alternatively. ?CR amplification of gene fragments can be carried out usmg anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments tiiat can 

35 subsequentiybeannealedandreamplifiedtogenerateacbimericgenesequence(see.for 
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example. Ausubel et al. (eds.) CuiwEOT Protocols in Molecular Biology, John WUey & 
Sons, 1992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 
invention can be cloned into such an expression vector such that the fusion moiely is linked 
5 in-fiame to the protein of the invention. 

4^ GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 
function of the encoded protein. The invention thus provides gene therapy to restore normal 
10 activity of the polypeptides of the invention; or to treat disease states involvmg polypeptides of 
the invention. DeUvery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (eg., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g.. Uposomes or chemical treatments). See. for example, 
15 Anderson, Nature, supplement to vol. 392, no. 6679. pp.25-20 (1998). For additional reviews of 
gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Vemia,,Scientific 
American: 68-84 (1990); and MiUer, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of tiie present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient expression) or 
20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteiQS of tiie present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing tiie expression of 
or inhibiting die activity of polypeptides of the invention wiU be usefiil in treating the disease 
25 states. It is contemplated tiiat antisense tiierapy or gene tiierapy could be appUed to negatively 
regulate tire expression of polypeptides of tiie invention. 

Other metiiods inhibiting expressionof a protein mclude flie introductionof antisense 
molecules to the nucleic acids of tiie present invention,tiieir complements, or tiieir translated RNA 
sequences,bymetiiodsknownintheart. Further, tiie polypeptides ofthe present invention can be 
30 uihibitedby using targeted deletion metiiods, or tiie insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

The present invention still fiirther provides cells geneticaUy engineered in vivo to expiesstiie 
polynucleotidesof tiie invention, wherein such polynucleotides are in operative association witii a 
regulatory sequence heterologousto tiie host ceU which drives expressionof the polynucleotides in 
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thecelL Thesemethods can be used to kcrease or decrease iheexpresaon of the polynucleotides of 
the present invention. 

Knowledgeof DNA sequencesprovidedby the invention allows formodificationof cells to 
permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g., by 
5 homologousrecombinadon)toprovideincreasedpolypeptidee3q)ressionbyrq)lacing,inwholeor 
in part, Ihe naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher leveb. The heterologouspromoteris inserted in such a manner that it is 
operatively linked to the desired proteinencodiug sequences. See, for example, PCT Inteniational 
PublicationNo. WO 94/12650, PCT International PubUcationNo. WO 92/20808, and PCT 
10 IntemationalPiibUcationNo. WO 91/09955.1tis also contemplatedthat, in addition to heterologous 
promoter DNA. amplifiablemarkerDNA(e.f., ada, dhfr, andthe multifimctionalCAD gene which 
encodes caibamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with tiie heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selectionmelhods results in 
15 co-amplificationoffhedesiiedproteincodingsequencesin the cells. 

In another embodiment of the presefet invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides ofthe invention under the control of 
inducible regulatory elements, m which case the regulatory sequences of the endogenous gene may 
be rq)lacedby homologous recombination. As described heran, genetai^eting can be used to 
20 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 
regulatory elements, transraiptional initiation sites, regulatory proteinbinding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability oftheRNA or 
25 proteinproduced may be replaced, removed, added, or otherwise modifiedby targeting. Hiese 
sequences include polyadenylationsignals, mRNA stabiUty elements, splice sites, leader sequences 
for enhancing ormodifjring transport or secretionproperties of the protein, or other sequences 
■which alter or unprove the fimction or stability of protem or RNA molecules. 

The targeting event may be a simple insertionof tiie regulatory sequence, placing the gene 
30 underthecontrolofthe new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 
element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, tiie 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 
replaced by an enhancer tiiat has broader or different ceU-type specificity tiian tiie naturally 
35 occuningdements. Here, the naturally occurring sequences are deleted and new sequences are 
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added. In aU cases, the identification of the targeting event may be fedlitatedby the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowingfor the selection 
of ceUs in whichthe exogenousDNA has integrated into Ihe ceU genome. The identificationof the 
targeting event may also be feciUtated by the use of one or more marker genes ejduHting Ihe 

5 property of negative selection, such that the negatively selectablemarker is linked to the exogenous 
DNA, but configured such that the negatively selectable maricer flanks the targeting sequence, and 
such ftat a correct homologous recombination event with sequences in the host cell genome does 
notresultin the stable integrationofthenegativelyselectablemarker. Markers useful for this 
purposeincludeflwHetpesSimplex Virus thymidmekinaseCrK.) gene or the bacterial 

10 xanlhin&-guaninephosphoribosyl-transferase(gpt)gene. 

The gene targeting or gene activationtechniques which can be used m accordance witii this 
aspectof lhe invention are more particularly described in U.S. PatentNo. 5,272,071 to Chappd; 
• U.S. PatentNo. 5,578,461 to Sherwin et al.; International AppUcationNo. PCT/US92/09627 
(WO93/09222)by Seldenet al.; and IntemationalApplicationNo. PCT/US90/06436 

15 (WO91/06667)by Skoultchietal., each of which is incorporated by reference heasdn in its oitirety. 

4S TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

20 inactivated in the line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals m which the gene is over ejqiressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals m which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" ammals. Knockout animals, preferably non-human mammals, can be 

25 prepared as described in U.S. PatentNo. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to determine the roles polypeptides of the invention play in biological 
processes, and preferably in disease stales. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metaboUsm. Transgenic animals, preferably non-human 
mammals, are produced using melhods as described in U.S. PatentNo 5,489,743 and PCX 

30 Publication No. W094/28122, moorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or mactivated to alter the level of expression 
of the polypeptides of die invention. Inactivation can be carried out using homologous 
recombmation methods described above. Activation can be achieved by supplementing or even 

35 replacing the homologous promoter to provide for increased protein ejqjression. llie homologous 
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promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
through, e.g. , homologous recombination or knock out strategies, of animals that faD to e^qaess 
5 polypejtidesoftheinventionorthatexpressavariantpolypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as weU as for studying modulators of the 
polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the mvention are either over expressed or 
10 inactivated in Ihe germ line of animals using homologous recombination [C^hi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under Ihe regulatoiy 
control of exogenous or endogenous promoter elements, are known as liansgenic animals. 
Animals in A^ch an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockouf ammals. Knockout animals, preferably non-human mammals, can be 
15 prepared as described in U.S. Patent No. 5,557.032, incorporated herein by reference. Transgenic 
animals are useful to determme the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate Upid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
20 Publication No. W094/28122. incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of tiie polynucleotides of tiie 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 
methods described above. Activation can be achieved by supplementing or even replacing the 
25 homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 

4.10 USES AND BIOLOGICAL ACTIVITY 

30 The polynucleotides and prbtcins of tiie present mvention are expected to exhibit one or 

more of tiie uses or biological activities (including tiiose associated with assays cited herein) 
identified herein. Uses or activities described for proteins of the present invention may be 
provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene tiierapies or vectors suitable for introduction of DNA). The 

35 mechanism underlying tiie particular condition or patiiology will dictate whether tiie 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment Thus, "therapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 
(including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 

5 polypeptides of the invention (including fuU length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overaU activity of the 
target gene products, either at the level of target gene/protein expression or target protein 
activity. Such modulatras include polypeptides, analogs, (variants), including ftagments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 

10 indirectly activate or inhibit thepolypeptides of the invention Cidentified, e.g., via drug screening 
assays as described herem); antisense polynucleotides and polynucleotides suitable for triple 
helbc formation; and in particular antibodies or other bmdmg partners that specifically recogni2ie 
one or more epitopes of the polypeptides of fee invention. 

The polypeptides of the present mvention may likewise be involved in cellular activation 

15 or in one of the other physiological pathways desaibed herein. 

4.10.1 RESEARCH USES AND mTLITlES 

The polynucleotides provided by the presemt invention can be used by the research 
■ community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or tiierqjeutic use; as markers for tissues in which the 

corresponding protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 
positions; to compare with endogenous DNA sequences in patients to identify potential genetic 

25 disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
mformation to derive PGR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 
oUgomers for attachment to a "gene chip" or other support, including for examination of 
expression patterns; to raise anti-protdn antibodies using DNA immunization techniques; and as 

30 an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 

polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction), the polynucleotide can also be used m interaction trap 
assays (such as. for example, that described in Gyuris et al.. Cell 75:791-803 (1993)) to identify 
polynucleotides encodmg the other protein with which binding occurs or to identify inhibitors of 

35 the binding interaction. 
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The polypeptides provided by the present inveation can sinularly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (mcluding the 
labeled reagent) in assays deagnedto quantitatively determine levels of the protein (or its 

5 receptor) in biological fluids; as markers for tissues in whichihe conesponding polypeptide is 
preferentiaUy expressed (either constitutively or at a particular stage of tissue differentiation or 
development or inadisease state); and, of course, to isolate correlative receptors or ligands. 
Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the bindipg intaaction. 

10 Any or aU of these research utiUties are capable of being developed into reagent grade or 

kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art 
References disclosing such methods include without limitation "Molecular Cloning: A 
Uboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch 

15 and T. Maniatis eds.. 1989, and "Methods m Enzymology. Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1 987. 

4.105 NUTRITIONAL USES 

Polynucleotides and pdypeptidcs of the present invention can also be used as nutritional 
20 sourcesorsupplements. Such uses mclude without limitation use as a protein or amino add 

supplemerrt.use asacarbon source, use asanilrogensourceanduseasasourceof carbohydrate. In 

suchcasesthe polypeptideorpolynucleotideoffheinvenlioncan be added to the feed of a 
particular organismor can be administeredasaseparatesoUd or Uquidprepaiation,such as inthe 
form of powder.piUs. solutions, suspensionsor capsules. In the case of microorganisms,1he 
25 polypeptideorpolynucleotideoftheinventioncanbeaddedtothemediuminoronwhichtfae 

microoTgaDismis cultuied. 

4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTUTION 
ACnVTTY 

30 A polypeptide of the present invention may exhibit activity relating to cytokine, cell 

proliferation (either inducing or inhibitmg) or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in certain ceU populations. A 
polynucleotide of the mvention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, mcluding all known cytokines, have exhibited activity in 

35 or more fector-dependent ceU proliferation assays, and hence the assays serve as a convenient 
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confinnation of cytokine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent ceU proliferation 
assays for ceU lines including, without limitation, 32D. DA2. DAIG. TIO. B9, B9/1 1, BaF3. 
MC9/G, M-KpreB M+), 2E8, RB5. DAI. 123. Tl 165, HT2. CTLL2, TF-1, Mo7e, CMK. 
5 HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 
Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immmiology. Ed by J. E. Coligan, A. M. Kruisbeek, D. H. MarguUes, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7. Immmiologic studies in 
10 Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; BertagnoUi et al., J. Immunol. 

145:1706-1712, 1990; Bertagnolli etal.. Cellular Immunology 133:327-341, 1991; BertagnoUi, 
etal.,L Immunol. 149:3778-3783, 1992; Bowman etal.,LImmunoL 152:1756-1761. 1994. 

Assays for cytokine pnxluction and/or proliferation of spleen ceUs, lyinph node cells or 
thymocytes include, without limitation, those described in: Polyclonal t cell stimulation. 
15 Knrisbeek. A. M. and Shevach. E. M. In Current Protocols in Immunology. J. E. e.a. CoUgan 
eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleukin-y, Schreiber, R D. In Current Protocols in Immmiology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John WUey and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 
20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4. Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immmiology. J. E. e.a. CoUgan eds. Vol 1 pp. 6J.1-6J.12, John Wiley and Sons, Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-121 1, 1991; Moreau et al.. Nature 336:690-692. 1988; 
Greenbergcr et al.,Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 
25 and human interleukin 6-Nordan, R. In Cmient Protocols in Immunology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John WUey and Sons. Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin ll-Bennett, P., Giannotti, J., 
Clark. S. C. and Turner. K. J. In Current Protocols in Immunology. J. E. CoUgan eds. Vol 1 pp. 
6.15.1 John WUey and Sons. Toronto. 1991; Measurement of mouse and human Interleukin 
30 9-Ciarletta, A., Giannotti, J., Clark, S. C. and Tumer, K. J. In Current Protocols in Immunology. 
J. E. CoUgan eds. Vol 1 pp. 6.13.1, John Wiley and Sons. Toronto. 1991. 

Assays for T-cell clone responses to antigens (which wiU identify, among others, proteins 
that a&ct APC-T cell interactions as well as direct T-ceU effects by measuring proUferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 
35 Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M Shevach. W Slrober. 
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Pub. Greene PubUshing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their ceUular receptors; Chapter 7, 
Inununologic studies in Humans); Weinberger et al., Proc. Nafl. Acad. Sd. USA 77:6091-6095, 
1980; Weinberger etal., Eur. J. Immun. 11:405411, 1981;Takaietal., J.Inmiunol. 
5 137:3494-3500, 1986; Takai et al., J. Imraunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACnVITY 
A polypeptide of the present invention may exhibit stem cdl growth factor activity and 
be involved in the proliferation, difBarentiation and survival of pluripotent and totipotent stem 
10 ccUs including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administtation of the polypeptide of the invention to stem cells in vivo or 
ex vivo is expected to maintam and expand ceU populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The abiUty to produce 
15 large quantities ofhumanceUs has important working applications for the production of human 

proteins which currently must be obtained from non-human sources or donors, implantation of 
ceUs to treat diseases such as Parkinson's, Alzheuner's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation su«5h as kidney, Uver, pancreas (including islet cells), heart and lung. 

It is contemplated tiiat multiple different exogenous growth factors and/or cytokines may 
be administered in combination with tiie polypeptide of tiie invention to achieve tiie desired 
effect, including any of the growth fectors listed herein, other stem ceU maintenance factors, and 
specifically including stem ceU factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt- 

25 3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 

mflammatory protein 1-alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth fector (PDGF), neural growth fectors and basic 
fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature ceU type, expansion of 

30 tiiese cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem'cells are known in tiie art and administration of polypeptides of the invention, 
optionally witii other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem ceU populations. Hiis can be accomplished by direct administration of 
tiie polypeptide of tiie mvention to tiie culture medium. Alternatively, stroma cells transfected 

35 wifli a polynucleotide tiiat encodes for tiie polypeptide of tiie invention can be used as a feeder 
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layer for the stem ceU populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal ceUs, fetal Uver cells, or 
cultured embryonic fibroblasts (see U.S, Patent No. 5,690,926). 

Stem ceUs themselves can be transfected with a polynucleotide of the invention to induce 
5 autocrine expression ofthe polypeptide of the invention, nris wiU allow for generation of 
undifferentiated totipotential/pluripotential stem ceU lines that are usefial as is or that can then be 
differentiated into the desired mature ceU types. These stable cell lines can also serve as a source 
of undifferentiated totipotential/pluripotential mKNA to create cDNA Ubraries and templates for 
polymerase chain reaction experiments. These studies would aUow for the isolation and 
10 identification of differentially expressed genes in stem ceU populations that regulate stem cell 
proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem ceU populations wiU be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepitheUal cells that can be 
15 used to augment or replace cells damaged by iUness, autoimmune disease, accidental damage or 
genetic disorders. The polypeptide ofthe invention mayije useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as weU as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 
20 the expanded stem cell popidations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantatioa 

Expression ofthe polypeptide ofthe invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of tiie stem cells into more differentiated cell 
types. A broadly appUcable method of obtaining pure populations of a specific differentiated 
25 cell type from undifferentiated stem cell populations involves the use of a cell-type specific 

promoter driving a selectable marker. The selectable marker allows only cells of tiie desired type 
to survive. For example, stem cells can be mduced to differentiate into cardiomyocytes (W obus 
et al., Differentiation, 48: 173-1 82, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle ceUs (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 
30 Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 

accomplished by culturing tiie stem ceUs in the presence of a differentiation factor such as 
retinoic acid and an antagonist ofthe polypeptide of tiie invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem ceUs can be used to determine if tiie polypeptide of tiie invention 
35 exhibits stem cell growfli factor activity. Stem cells are isolated from any one of various ceU 



42 



wo 01/57188 



PCTAJSOl/03800 



sources (including hematopoietic stem celk and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Pioc. NatL Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 
the presence of Ihe polypeptide of the invention alone or in combination with other growth 
fectors or cytokines. The abiUty of the polypeptide of the invention to induce stem cells 
5 proliferation is detennined by colony fonnation on semi-solid support e.g. as described by 
Bernstein et al.. Blood, 77: 2316-2321 (1991). 

410.5 HEMATOPOIESIS REGUIATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 

10 and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even margmal 

biological activity in support of colony forming cells or of fector-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. m supporting the growth and proliferation of 
erytfaroid progenitor cells alone or in combination with other cytokmes, thereby mdicating 
utility, for example, in treating various anemias or for use in conjunction with 

15 irradiation/chemotherapy to stimulate the production of erylhroid precursors and/or erythroid 
cells; in supportfag the growth and proliferation of myeloid cells such as granulocytes and 
monocytes/macrophages (i.e.. traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppiession; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 

20 treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complunentary to platelet Hansfiisions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic ceUs and therefore find therapeutic utiUty in various stem cell disorders (such as 
those usuaUy Heated with transplantation, including, without limitation, aplastic anemia and 

25 paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo {i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor ceU transplantation (homologous or heterologous)) • 
as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 

30 Suitable assays for proliferation and differentiation of various hematopoietic lines are 

cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
■ proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular 
35 and CeUulai Biology 13:473-486, 1993; McClanahan et al., Blood 81 :2903-2915, 1993. 
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Assays for stem cell survival and differentiation (wiiich vvill identify, among others, 
proteins that regulate lymphchematopoiesis) include, without limitation, those described in: . 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
Fieshney. et d. eds. Vol pp. 265-268. Wiley-Liss. Inc., New Yoik. NX 1994; Hiiayama 

Proc. Nad. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
ydth high proliferative potential, McNieoe, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney. et al. eds. Vol pp. 23-39. WUey-Liss, Inc., New York. N.Y. 1 994; Neben et 
al.. Experimental Hematology 22:353-359. 1 994; Cobblestone area fonning ceU assay, 
Ploemacher, R. E. In Culture of Hematopoietic CeUs. R. I. Freshney, et al. eds. Vol pp. 1-21. 
Wiley-Liss. Inc.. New York, N.Y 1994; Long temi bone marrow cultures in the presence of 
stromal cells. Spooncer. E., Dexter. M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
Freahney.etal.ed8. Volpp. 163-179. Wiley-Liss. Inc.. New York. N.Y. 1994; Long term culture 
initiating ceU assay, Sutherland. H. J. In Culture of Hematopoietic Cells. R. I. Freshney. et al. 
eds. Vol pp. 139-162. WUey-Liss. hic, New York. N.Y. 1994. 
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4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved m bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as weU as in wound healing and tissue 
repair and replacement, and in healing of bums, incisions and ulcers. 
20 A polypeptide of the present invention which induces cartUage and/or bone growth in 

circumstances where bone is not normally formed, has appUcalion m the healing of bone 
ftactures and cartUage damage or defects in hum^ and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the inventionmay have 

prophylactic use in closed as weU as open fiacture reduction and also in the improved fixation of 
25 artifidal joints. De novo bone formation induced by an osteogenic agent contributes to the 

repair of congenital, trauma induced, or oncologic resection mduced craniofecial defects, and 

also is useful m cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming cells. 

stimulating growth of bone-foiming cells, or inducing differentiation of progenitors of 
30 bone-formmg ceUs. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 

periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 

inflammation or processes of tissue destruction (coUagenase activity, osteoclast activity, etc.) 

mediated by inflammatory processes may also be possible using the composition of the 

invention. 
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Anolher category of tissue legeneration activily that may involve the polypeptide of the 
present invention is tendonAigament fonnation. Induction of tendonnigament-like tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has 
appHcation in the healing of tendon or ligament tears, deforauties and other tendon or Ugament 

5 defects m humans and other animals. Such a preparation employing atendon/Ugament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or Ugament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or Hgament tissue. De novo tendon/Ugament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 

10 induced, or other tendon or Ugament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 
invention may provide environment to attract tendon- or Ugament-forming cells, stimulate 
growth of tendon- or ligament-foiming cells, induce differentiation of progenitors of tendon- or 
ligament-fonning cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 

15 return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 
treatment of tendinitis, carpal tunnel syndrome and other tendon or Ugament defects. The 
compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is 
weU known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 
20 cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in tiie treatinent of diseases of tiie peripheral nervous system, such as 
peripheral nerve injuries, peripheral neuropatiiy and localized neuropatiues, and central nervous 
25 system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease. amyotropWc 
lateral sclerosis, and Shy-Diager syndrome. Further conditions which may be treated in 
accordance with the i)resent invention include mechanical and traumatic disorders, such as spinal 
cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting fiom chemotiierapy or other medical therapies may also be treatable using a 

30 conqK>sition of the invention. 

Compositions of tiie invention may also be useful to promote better or faster closure of 
non-heaUng wounds, including witiiout limitation pressure ulcers, ulcers associated witii vascular 
insuflBciency, surgical and tiaumatic wounds, and tiie like. 

Compositions of tiie present invention may also be involved in die generation or 
35 regeneration of otiier tissues, such as organs (including, for example, pancreas, liver, intestine. 
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kidney, skin, endoliidixim). muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fihrotic scarring may aUow normal tissue 
to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
5 A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resiilling from systemic cytokine damage. 

. A composition of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
10 growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
Assays for tissue generation activity include, without limitation, those described in: 
International Patent PubUcationNo. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
15 WO91/07491 (skin, endothelium). 

Assays for wound hesding activity irjclude, without limitation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.). Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest Dematol 
71:382-84(1978). 



20 



4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 
A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 
herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 

25 protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 
and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as weU as bacterial or fimgal infections, or may result firom autoimmune disorders. More 

30 specifically, infectious diseases causes by viral, bacterial, fungal or otiier infection may be 

treatable using a protein of the present invention, including infections by HIV, hepatitis vimses, 
heipes viruses, mycobacteria. Leishinania spp., malaria spp. and various fiangal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, i.e., in the treatinent of cancer. 
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Autoimmune disorders which may be treated using a protein of Ihe present invention 
include, for example, comiective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune puhnonary inflammation, GuiUain-Barre syndrome, 
autoimmune thyroiditis, insulin dependent diabetes meUitis. myasthenia gravis, graft-versus-host 

5 disease and autoimmune inflammatory eye disease. Such aprotein (or antagonists thereof, 

• including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g.. anaphylaxis, serum sickness, drug reactions, food allergies, msect 
venom allergies, mastocytosis, aUergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema. eczema, atopic dermatitis, aUergic contact dermatitis, erythema multiforme. 

10 Stevens-Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asfluna) or other respiratory problems. Otiier conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of tiie 

1 5 polypeptides or antagonists tiiereof on allergic reactions can be evahiated by in vivo animals 

models such as the cumulative contact aihancement test (Lastbom et al.. Toxicology 125: 59-66, 
1998). skin prick test (Hoffinami et al.. Allergy 54: 446-54. 1999), guinea pig skin sensitization 
test (Vohr et al.. Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 

■J. Toxicol. Environ. Healtii 53: 563-79). 
20 Using tiie proteins ofthe invention it may also be possible to modulate immune 

responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing tiie induction of an immune 
response. Tbs functions of activated T ceUs may be inhibited by suppressing T ceU responses or 
by inducing specific tolerance in T cells, or botii. Immunosuppression of T cell responses is 

25 generally an active, non-antigen-spedfic, process which requires continuous exposure of die T 
cells to die suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy 
in T ceUs. is distinguishable from immunosuppression in tiiat'u is generally antigen-specific and 
persists.after exposure to tiie tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by tiie lack of a T cell response upon reexposure to specific antigen in tiie absence 

30 of tiie tolerizing agent. 

Down regulating or preventing one or more antigen functions (including witiiout 
limitation B lymphocyte antigen fiinctions (such as. for example, B7)). e.g., preventing high 
level lymphokine syntiiesis by activated T cells, will be useful in sitiiations of tissue, skin and 
organttansplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 

35 fimction should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejectioii of the transplant is initiated tbrough its recognition as foreign by T cells, 
followed by an immune leaction that destroys the transplant Ihe administration of a therapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T ceUs, 
and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be sufficient 
5 to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of iq)eated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolesance in a subject, it 
may also be necessary to block the function of a combination of B lymphocyte antigens. 
The efficacy of particular ther^eutic compositions in preventing organ transplant 
10 rejection or GVHD can be assessed usmg animal models that are predictive of efficacy in 

humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 
rats and xenogeneic pancreatic islet ceU grafts in mice, both of which have been used to examine 
the immunosuppressive efifects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
aL, Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1102-1 1 105 
15 (1992). In addition, murine models of GVHD (see Paul ed.. Fundamental Immunology. Raven 
Press, New York, 1989, pp. 846-847) can be used to deteraiine the effect of therapeutic 
compoations of the invention on the developmoat of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T ceils that are 
20 reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or eliminate disease symptoms. Administration of reagents which block stimulation oft 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 
cell-derived cytokines which may be involved in the disease process. Additionally, blocking 
25 reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 

long-term reUef from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 
human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune 
30 collagen arthritis, diabetes meUitus in NOD mice and BB rats, and murine experimental 

myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen fimction), as a means 
of up regulating immune responses, may also be useful in therapy. Upregulation of immune 
35 responses may be in the form of enhancmg an existing immune response or eUciting an initial 
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immune response. For example, enhancing an inunune response may be useful in cases of viral 
infection, including systemic viral diseases sudi as influenza, the common cold, and 
encephalitis. 

Alternatively, anti-viial immune responses may be enhanced in an infiscted patient by 

5 removing TceUsfiom the patient, costimulating the T cells in vitro wth viral antigen-pu^ 
APCs either expressing a peptide of the present invention or together vdth a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
patient. Another method of enhancing anti-viral unmune responses would be to isolate mfected 
cells from a patient, transfect them with a nucleic acid encoding a protem of the present 

10 invention as described herein such that the cells express aU or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 
■ capable of deUvering a costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T ceU mediated immune response against the transfected tumor cells. In 

15 addition, tumor cells whidi lack MHC class I or MHC class H molecules, or which fail to 

reexpress sufBcient mounts of MHC class I or MHC class U molecules, can be transfected with 
nucleic acid encodmg all or a portion of (eg., a cytoplasmic^iomain truncated portion) of an 
• MHC class I alpha chain protem and p2 microglobulin protein or an MHC class H alpha chain 
protein and an MHC class H beta chain protein to thereby express MHC class I or MHC class II 

20 proteins on the cell surface. Expression of the appropriate class I or class H MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen B7-1, B7-2, B7-3) induces a T 
ceU mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class U associated protein, such as 
the mvariant chain, can also be cotransfected with aDNA encoding apeptide havmg the activity 

25 of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific hnmunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufBciemt to overcome tumor-spedfic tolcaance in the subject 

The activity of a protein of the invention may, among other means, be measured by tiie 
following methods: 

30 Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 

those described in: Current Protocols m Lnmunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
H. MarguUes, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wilcy-Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3. 1-3.1 9; 
Chapter 7, Immunologic studies m Humans); Hernnann et al., Proc. Natl. Acad. Sci. USA 

35 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 



49 



PCT/USOl/03800 

WO 01/57188 

Immunol. 135:1564-1572, 1985; Takaietal.. I. Immunol. 137:3494-3500. 1986;Takaietal.. J. 
Immunol. 140:508-512. 1988; Bownanet al., J. Virology 61:1992-1998; Bertagnolli et al.. 
CeUular Immunology 133:327-341. 1991; Brown etal., J. ImmunoL 153:3079-3092, 1994. 
Assays for T-cdl-dependent immunoglobulin responses and isotype switching (which 
5 will identify, among others, proteins that modulate T-ceU dependent antibody responses and that 
afect 'n»imi2 proffles) include, without limitation, those described in: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
Mond, J. J. and Brunswick, M. In Current Protocols inlmmunology. J. E. e.a. CoUgan eds. Vol I 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 
10 Mixed lymphocyte readion (MLR) assays (which will identify, among others, proteins 

that generate predominantly Thl and CTL responses) include, without limitation, those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach. W. Strober, Pub. Greene PubUshing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
15 Humans); Takai et al., J. ImmunoL 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, v«1hout limitation, those described in: 
Query etal., J. Immunol. 134:536-544. 1995; Inaba et al., Journal of Experimental Medicine 
20 173:549-559, 1991; Macatonia et al.. Journal of Immunology 154:5071-5079, 1995; Porgador et 
al.. Journal of Experimental Medicine 182:255-260, 1995; Nair et al., Journal of Virology 
67:4062-4069. 1993; Huang et aL, Science 264:961-965. 1994; Macatonia et al.. Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al.. Journal of Clinical Investigation 
94:797-807, 1994; and Inaba et al.. Journal of Experimental Medicine 172:63 1-640, 1990. 
25 Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 

that prevent apoptosis after superanligen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 
13:795-808, 1 992; Gorczyca et al.. Leukemia 7:659-670. 1993; Gorczyca et al.. Cancer Reseaich 
53:1945-1951, 1993; Itoh et al., CeU 66:233-243. 1991; Zacharchuk. Journal of Immunology 
30 145:4037-4045, 1990; Zamai et al.. Cytometry 14:891-897, 1993; Gorczyca et al.. International 
Journal of Oncology 1:639-648, 1992. 

Assays for proteins thatinfluence early steps of T-ceU commitment and development 
include, without limitation, those described in: Antica et al., Blood 84:1 11-117, 1994; Fine et al.. 
CeUular Immunology 155:111-122, 1994; Galy etal.. Blood 85:2770-2778, 1995; Toki et al.. 
35 Proc. Nat Acad Sd. USA 88:7548-7551, 1991. 
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4.10.8 ACnvm/INfllBIN ACTIVITY 

A polypeptide of the present invention may also exhilnt activin- or inhibin-related 
activities. A polynucleotide of tiie invention may encode a polypeptide exhibiting such 

5 diaracteristics. InMbms are characterized by their abihty to inMWt the release of fomcle 

stimulating hormone (FSH), whUe activins and are characterized by tiieir ability to stimulate tiie 
release of foUide stimulating hormone (FSH). Thus, apolypeptidc of tiic present invention, 
alone or in heterodimers with a member of the mhibin family, may be useful as a contraceptive 
based on tiie ability of mhibins to decrease fartitity m female mammals and decrease 

10 spermatogenesis in male mammals. Administration of sufScient amounts of other inhibins can 
mduce infertiUty m these mammals. Alternatively, the polypeptide of the invention, as a 
homodimer or as a heterodimer witii otiier protein subunits of the inhibin group, may be useful as 
a futility mducing flierapeutic, based upon the abiUty of activin molecules in stimulating FSH 
release fitom cells of tiie anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 

15 polypeptide of tiie invention may also be useful for advancement of tiie onset of fertility in 

sexually immature mammals, so as to mcrease tiie lifistime reproductive performance of domestic 
animals sudi as, but not lunited to, cows, sheep and pigs. 

The activin of a polypeptide of tiie invention may, among otiier means, be measured by 
tiie following methods. 

20 Assays for activin/inhibm activity mdude, witiiout lumtation, tiiose described in: Vale et 

al.. Endocrinology 91 :562-572, 1972; Ling et al.. Nature 321:779-782, 1986; Vale et al., Nabne . 
321:776-779, 1986; Mason et al.. Nature 318:659-663, 1985; Forage et al., Proc. Nati. Acad. Sci. 
USA 83:3091-3095, 1986. 

25 4.10.9 CHEMOTACTIC/CHEMOKINETIC ACnVITY 

A polypeptide of tiie present invention may be involved m chemotactic or chemokmetic 
activity for mammaUan cells, including, for example, monocytes, fibroblasts, neutrophUs, 
T-cells, mast ceUs. eosinophils. epitiieUal and/or endotiielial ceUs. A polynucleotide of tiie 
invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 

30 receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokmetic compositions (e.g. proteins, antibodies, binding partners, or 
modulators of tiie invention) provide particular advantages in treatment of wounds and otiier 
trauma to tissues, as weU as m treatment of localized uifections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in unproved 

35 immune responses agamst the tumor or mfecting agent 
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A protem or peptide has chemotactic activity for a particular ceU population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a population of cells can be readUy 

5 determined by employing such protdn or peptide in any known assay for ceU chemotaxis. 
Therapeutic compositions of the invention can be used in the foUowing: 
Assays for chemotactic activity (which wiU identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 

• cells across a membrane as well as the aWUty of a protein to induce the adhesion of one ceU 

10 population to another cell population. Suitable assays for movement and adhesion include. 

without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan. A. 
M. Kruisbeek, D. H. Marguiles. E. M. Shevach. W. Strober. Pub. Greene Pubhshing Associates 
and Wiley-Interscience (Chapter 6.12. Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lmd et aL APMIS 103:140-146. 

15 1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867. 
1994; Johnston et al. J. of Immunol. 153:1762-1768. 1994. 

4.10.10 HEMOSTATIC AND THROMBOLyncAcnvrry 

A polypeptide of the invention may also be mvolved in hemostatis or thrombolysis or 
20 thrombosis. A polynucleotide of the invention can encode apolypeptide exhibiting such 

attributes. Compositions may be useful in beatment of various coagulation disorders (including 
hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 
in treating wounds resulting from trauma, surgery or other causes. A composition of the 
invention may also be useM for dissolving or inhibiting formation of thromboses and for 
25 treatment and prevention ofconditions resulting fl.erefit«n (such as, for example, infers 

cardiac and central nervous system vessels (e.g., stroke). 

nienq)eutic compositions of the invention can be used m flie foUowing: 

Assay for hemostatic and thrombolytic activity include, witixout limitation, tiiose 

described in: Linet et al.. J. Clin. Phannacol. 26:131-140, 1986; Burdick et al., TTirombosis Res. 
30 45:413-419, 1987; Humphrey et al.. Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

35:467-474, 1988. 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
35 metastasis. Detection of tiie presence or amount of polynucleotides or polypeptides of tiie 
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invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polymicleotide/polypeptide of the invention 
may indicate ahereditary risk of cancer, aprecancerous condition, or an ongoing maUgnancy. 
Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 

5 condition. Identification of single nucleotide polymorphisms associated with cancer or a . 
predisposition to cancer may also be use&l for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
uihibiting angiogenesis (growfti of new blood vessels that is necessary to support tumor growth) 
and/orprohibitingmetastasisbyieducingtumorceUmotilityorinvasiveness. Therapeutic 

1 0 compositions of the invention may be effective in adult and pediatric oncology including in soUd 
phase tumors/maUgnancies, locally advanced tumors, human soft tissue sarcomas, metastatic 
cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias. and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 

15 cancers, breast cancers including small ceU carcinoma and ductal carcinoma, gastrointestinal 
cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, Uver cancer, urologic cancers including 
bIaddercanccrandprostatecancer.maUgnandesofthefemalegemtaltractindudingovm^ 

carcinoma, uterine (including endometrial) cancers, and solid tumor m the ovarian follicle. 
20 kidney cancers including renal cell carcinoma, brain cancers including intrinsic bram tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor ceU invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including maUgnant melanoma, 
tumor progression of human skin keratinocytes, squamous ceU cardnoma, basal cell carcinoma, 
hemangiopericytoma and Rarposi's sarcoma. 

25 Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 

inhibitor and stimulators of the biological activity of the polypeptide of the invention) may be . 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 
effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 
chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 

30 efect. eg. reducing tumor size, slowing rate of tumor growth, inHbiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

The composition can also be administered in therapeuticaUy effective amounts as a 
portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of tiie invention vnth one or more anti-cancer drugs m addition to a phaimaceutically 

35 acceptable carrier for delivery. The.useofanti-cancer cocktails as a cancer treatment is routine. 
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Anti-cancer drugs that are weU known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, Aminoghitethimide, 
Asparaginase, Bleomycin, Busulfan, Caiboplatin, Cannustine, Chlorambucil, Cisplatin (cis- 
DDP). Cyclophosphamide, Cytarabine HCl (Cytosine arabinoside), Dacarbazine. Ddctinomycin. 
5 . Daunorubicin HCl, Doxorubicin HCl, Estramustiiie phosphate sodium. Etoposide(V16-213), 
Floxuridine, S^Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide). Ifosfamide, 
Interferon Alpha.2a, Interferon Alpha-2b. LeuproUde acetate (LHRH-releasmg fector analog), 
Lomustine, Mechlprdhamine HCl (nitrogen'mustard), Melphalan, Mercaptopurine, Mesna, 
Methotrexate (MJX), Mitomycin, Mitoxantrone HCl. Octreotide, PUcamycin, Procarba2dne HCl, 
10 Streptozocin, Tamoxifen citrate. Tluoguanine. Thiotepa, Vinblastine sulfete. Vincristine sulfate. 
Amsacrine, Azacitidine, Hexamelhylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfete. 

In addition, therapeutic compositions of the mvention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and^or environmental situations (e.g. 
15 exposure to carcinogens) known in the art that predispose an individual to developing cancers. 
Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. TTiese in vitro models include proUfcration assays of 
20 cultured tumor celU, growth of cultured tmnor ceUs in soft agar (see Freshney, (1 987) Culture of 
Animal CeUs: A Manual of Basic Technique, WUy-Liss. New York, NY Ch 18 and Ch 21), 
tumor systems in nude mice as described in GiovaneUa et al.. J. Natl. Can. Inst., 52: 921-30 
(1974), mobiUty and invasive potential of tumor cells in Boyden Chamber assays as described in 
Pilkin^on et al.. Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
25 of vascularization of the chick chorioallantoic membrane or induction of vascular endotheUal 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol.. 40: 1 189-97 (1 999) and Li et al., 
Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor ceUs lines are available, 
e.g. fiom American Type Tissue Culture CoUection catalogs. 

30 4.10.12 RECETTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
receptor ligand or inhiUtor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
and ligands include, without limitation, cytokine receptors and their ligands. receptor kinases and 
35 theirligands.receptorphosphatasesandlheirligands,receptorsinvolvedinceU^»Um^^ 
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and their ligands (including without limitation, ceUular adhesion molecules (such as selectins. 
integrins and their ligands) and receptor/Ugand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and Uganda 
are also usefiil for screening of potential peptide or small molecule inhibitors of the relevant 
5 receptor/Ugand interaction. A protein of the present invention (including, without limitation, 
fiagmente of receptors and ligands) may themselves be useful as inhibitore of receptor/ligand 
interactions. 

TTie activity of a polypeptide of the invention may, among other means, be measured by 
the following methods: 

10 Suitable assays for receptor-Ugand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E. CoUgan, A. M. Kruisbeek, D. H. Margdies. E. M. • 
Shevach, W. Strober, Pub. Greene Publishmg Associates and Wiley- Interscience (Chapter 728, 
Measurement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Takai et al.. Proc. 
Natl. Acad. Sd. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1 145-1 156, 1988; 

15 Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stohenborg et al., J. Immunol. Methods 
175:59-68, 1994; Stittetal., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) thereby transmittmg the biological activity of that ligand(s). Ugands may be identified 
through Knding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 

20 overlay assays, or other methods known in the art 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 
partial antagonist require the use of other proteins as competing Ugands. The polypeptides of the 
present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 
colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 

25 Purification" Murray P. Deutscher (ed) Methods in En2ymology Vol. 182 (1990) Academic 
Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to. tritium and 
carbon-14 . Examples of colorimetric molecules mclude, but are not limited to, fluorescent 
molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 
toxins include, but are not limited, to ricin. 

30 

4.10.13 DRUG SCREENING 

This invention is particularly usefiU for screening chemical compounds by using the 
novel polypeptides or binding fiagments thereof in any of a variety of drug screening techniques, 
nie polypeptides or fragments employed in such atest may either be free in solution, affixed to a 
35 soUdsupport,bomeonacellsurfaceorlocatedintraceUularly. One method of drug screening 
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iilUizes eukaryotic or prokaiyotic host cells which are stably transformed with recombinant 
nucleic acids expressingihe polypeptide or a fragment thereo£ Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed fonn, can 
be used for standard binding assays. One may measure, for example, the fomiation of 
5 complexes between polypeptides ofthe invention or fragments and the agem being tested 0^ 

examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, whidi arc well known in the art. 

Sources for test compounds that may be screened for abiUty to bind to or modulate {Le., 
incrcase or decrease) the activity of polypeptides of the invention include (1) inorganic and 
10 organic chemical libraries, (2) natural Fodnct libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readUy synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or compomids 

that are identified as "hits" or "leads" via natural product screening. 
15 The sources of natural product Ubraries are microorganisms (including bacteria and 

ftngi), animals, plants or other vegetation, or marine organisms, and Ubraries of mfartures for 
screening may be created by: (1) femientation and extraction of broths fit>m soil, plant or marine 
miciooiganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

20 review, see Science 252:63-68 (1998). 

Combinatorial Hbraries are composed of large numbers of peptides, oligonucleotides or 
organic compomids and can be readily prepared by traditional automated synthesis methods, 
PGR, cloning or proprietary synthetic methods. Of particular interest are peptide and 
oligonucleotide combinatorial Ubraries. StiU other Ubraries of interest include peptide, protein, 

25 peptidomimetic, multiparallel synthetic collection, recombinatorial. and polypeptide Ubraries. 
For areview of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opiru 
Biotechnol. 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 
Al-Obeidi et al., Mol Biotechnol, 9(3):205-23 (1998); Hruby et al.. Curr Opin Chem Biol, 
1(1):114-19 (1997); Domer et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

30 Identification of modulators through use of the various libraries described herein permits 

modification of the candidate ♦'hit" (or 'lead") to optimize the cq)acity of the "hit" to bind a 
polypeptide of the invention. Tbe molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are weU known in the 
art. Inbrief,themoleculesaretitratedintoapluraUtyofcenculturesoranimalsandthentested 

35 for either cell/animal death or prolonged survival of the animal/ceUs. 
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Tte binding molecules thus identified may be complexed with toxins, e.g., licin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the spedficily of the binding 
molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
5 complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACnVlTY 

The invention also provides mefliods to detect specific binding of a polypeptide e.g. a 
Ugand or a receptor. The art provides numerous assays particularly usefiil for identifying 

10 previously unknown binding partners for receptor polypeptides of the invention For example, 
expression cloning using mammaHan or bacterial cells, or dihybrid screening assays can be used 
to idaitify polynucleotides cncotHng binding partners. As anotiier example, affinity 
chcomatographyjwith the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides tiiat recogniTe and bind polypeptides of tiie invention. There are a number 

15 of different libraries used for tiie identification of compounds, and in particular small molecules, 
that modulate (ie.. increase or decrease) biological activity of a polypeptide of tiie invention. 
Ligands for receptor polypeptides of flie invention can also be identified by adding exogenous 
Uganda, or cocktails of Ugands to two cells populations tiiat are genetically identical except for 
tiic expression of tiie receptor of tiie invention: one cell population expresses the receptor of tiie 

20 invention whereas tiie otiier does not. The response of the two cell populations to the addition of 
ligands(s) are tiien compared. Alternatively, an expression library can be co-expressed witii tiie 
polypeptide of flie invention in ceUs and assayed for an autocrine response to identify potential 
ligand(s). As still anotiier example, BIAcore assays, gel overlay assays, or otiier metiiods known 
in tiie art can be used to identify binding partner polypeptides, including. (1) organic and 

25 inorganic chemical Ubraries, (2) natural product libraries, and (3) combinatorial Ubraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in tiie signaling cascade of tiie 
polypeptide of tiie invention can be determined. For example, a chimeric protem in which tiie 
cytoplasmic domam of tiie polypeptide of tiie invention is fused to tiie extracellular portion of a 

30 protein, whose ligand has been identified, is produced in a host ceU. The cell is tiien mcubated 
wifli tiie ligand specific for tiie extraceUular portion of tiie chimeric protein, tiiereby activating 
tiie chimeric receptor. Known downstream proteins involved in intraceUular signaling can tiien 
be assayed for expected modifications U phosphorylation: Otiier mefliods known to tiiose in tiie 
art can also be used to identify signaling molecules involved in receptor activity. 

35 
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4.10.15 Arm-BSFIJWVlMATORYACTIVrrY 

Compositions ofihe present invention may also exhibit anti-inflammatory activity. The 
anti-infiammatoiy activity may be achieved by providing a stimulus to cells involved in the 
inflammatory response, by inhibiting or promoting ceU-cell interactions (such as. for example, 

5 cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell exiravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 
witii such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 

10 shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rqection, nephritis, cytokine or 
chemoldne-bduced lung iiyury, inflammatory bowel disease. Crohn's disease or result 
over production of cytokines such as TOT or IL-l . Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

15 Compositions of this invention may be utilized to prevent or treat conditions such as. but not 
limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic ceU damage ftom diabetes melUtus type 1, 
graft versus host disease, inflammatory bowel disease, inflamation associated with puhnonaiy 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 

20 acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
25 therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders mclude but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, promyelocytic, 
myelomonocytic. monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 
(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al.. 1985. Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 



30 



4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efBcacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
35 polypeptides of the invention, and which can be treated upon thus observing an indication of 
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4erapeutic iitility, include but are not limited to nervous system injuries, and diseases or 
disorders wHch result in eitheradiscomiertion of axons,adiminution or degener^ 

neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
human and non-human mammalian patients) according to the invention include but are not 

5 liinited to the following lesions of either the central Cmduding spinal cord, brain) or peripheral 

nervous systrans: 

(i) traumaticlesions,includinglesionscausedbyphysicalinjuryorassociated^vith 
surgery, for example, lesions which sev^r a portion of the nervous system, or compression 
injuries; 

10 (ii) ischemiclesions,mwhichalackofoxygeninaportionofthenervoussystem 

results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

infarction or ischemia; 

(in) infectious lesions, in v»*ichaportionofthe nervous system is destroyed or 

injured as a result of infection, for example, by an abscess or associated with infection by human 
15 immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 

tuberculosis, syphilis; 

(iv) degenerative lesions, in y^iich a portion .ofthe nervous system is destroyed or 

injured as aresult of a degenerative process including but not limited to degeneration associated 
yaih Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 
20 sclerosis; 

(V) lesions associated with nutritional diseases or disorders, in which a portion of die 
nervous system is destroyed or injured by a nutritional disorder or disorder of metaboUsm 
including butnot limited to, vitamin B12 deficiency, foUc acid deficiency, Wernicke disease, 
tobacco-alcohol amblyopia. Marchiafava-Bignami disease (primary degeneration of tiie corpus 
25 callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated wifli systemic diseases including but not limited to 
diabetes (diabetic neuropatiiy. Bell's palsy), systemic lupus eryflxematosus. carcinoma, or 
sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 

30 neurotoxins; and 

(viu) demyelinated lesions in which a portion oftiie nervous system is destroyed or 

injured by a demyelinating disease including but not hmited to multiple sclerosis, human 
immunodeficiency virus-associated myelopatiiy. transverse myelopatiiy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 
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Therapeutics which are usefol according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
any of the foUowing effects may be useful according to the invention: 
5 (i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., 
choline acetyltransferase or ac^lcho^nesterase with respect to motor neurons; or 

(iv) decreased syn^tomsofneuron dysfunction in vivo. 

10 Sucheffectsmaybemeasuredbyanymethodknownintheart. Inpreferred, 

non-limiting embodiments, increased survival of neurons may be measured by the method set 
forfli in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 

15 be measured by bioassay. enzymatic assay, antibody binding. Northern blot assay, etc., 

depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neitron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 

20 invention include but are not limited to disorders such as inferction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may afiect motor neurons as • 
well as oflier components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and includmg but not limited to progressive spinal 
muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 

25 muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poUomyeUtis and die post poUo syndrome, and Hereditary Motorsensory Neuropathy 
(CJharcot-Marie-Tootii Disease). 

4.10.18 OTHER ACnvmES 

30 A polypeptide of the invention may also exhibit one or more of the following additional 

activities or effects: inhibiting the growtii, infection or function of, or killing, infectious agents, 
including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color. eye 
color, skin, fet to lean ratio or other tissue pigmentation, or organ or body part size or shape 

35 (such as, for example, breast augmentation or diminution, change in bone form or shape); 
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effecting biorhytiims or circadian cycles or rhythms; effecting the fertiUty of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
elimination of dietary fet, lipid, protein, carbohydrate, vitamins, minerals, co-fectors or other 
nutritional fectors or component(s); effecting behavioral characteristics, including, without 

5 limitation, appetite, Ubido, stress, cognition (including cognitive disorders), depression 

(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 
than hematopoietic Imeages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 

10 hypeiproliferative disorders (such as, for example, psoriasis); unmunoglobulm4ike activity (such 
as, for example, the ability to bmd antigens or complement); and the abiUly to act as an antigen 
in a vaccme composition to raise an immune response against such protdn or another material or 
entity which is cross-reactive vrifli such protein. 

15 4.10.19 IDENTIFICATION OF POLYMORPfflSMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms m human subjects and the phannacogenetic use of this information for diagnosis 
and treatment. Such polymorphisms may be associated with. e.g., differential predisposition or 
susceptibiUty to various disease states (such as disorders involving mflammation or immune 
20 response) or a differential ieq)onse to drug administration, and this genetic infonnation can 

used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying tiie presence of the 
polymorphism. 

25 Polymorphisms can be identified in a variety of ways known in the art which all 

generaUy involve obtaining a sample fix)m a patient, analyzmg DNA fiom the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 
polymorphism m the DNA. For example, PGR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to 

30 allele-specific oUgonucleotide hybridization (in which appropriate oligonucleotides are 

hybridized to the DNA under conditions peraiitting detection of a single base mismatch) or to a 
single nucleotide extension assay (in which an oUgonucleotide that hybridizes immediatety 
adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fiagment length polymoiphism analysis (using restriction 

35 enzymes that provide differential digestion of the genomic DNA depending on the presence or 
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absence of the polymoiphism) may be perfoimed. Arrays willi nucleotide sequences of the 
present mvention can be used to detect polymorphisnis. The array can comprise modified 
nucleotide sequences of the present invention in order to detect the nucleotide sequences of &e 
present invention. In the altemadve, any one of the nucleotide sequences of the present 
5 invention can be placed on tiie array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protdn, e.g., 
by an antibody specific to tin variant sequence. 

LO 4.10J0 ARTHRITIS AND DIFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 
artinitis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats, and die protocol is described by J. Holoshitz, et at, 1983, 
Science. 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol.. 23:129. 

15 Induction of tiie disease can be caused by a single injection, generally intradennally, of a 
suspension of killed Mycobaaerium tuberculosis in complete Freund's adjuvant (CFA). "Ilie 
route of injection can vary, but rats may be injected at the base of die tail witii an adjuvant 
mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
1-5 mg/kg. The control consists of administering PBS only. 

20 The procedure for testing tiie effects of the test compound would consist of intradermally 

injecting kiUed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 
described by J. Holoskitz above. An analysis of tiie data would reveal that die test compound 

25 would have a dramadc affect on the swelling of tiie joints as measured by a decrease of tiie 
artiuttis score. 

4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
30 otiier binding partners or modulators including antisense polynucleotides) of die invention have 
numerous appUcations in a variety of tiierapeutic metiiods. Examples of therapeutic appUcations 
include, but are not limited to, those exanplified herein. 

4.11.1 EXAMPLE 
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One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals afifected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode of 
adnunistration is not particularly important, parenteral administration is prefened. An 

5 exemplary mode of administration is to. deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. • It is to be expected that the dosage will vary accordmg to the age, weight, 
condition and response of the individual patient Typically, the amount of polypeptide 
administered per dose will be in the range of about 0.01 ^g/kg to 100 mg/kg of body weight, with 

10 the preferred dose being about O.ljig/kg to 10 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated in an mjectable form combined 
with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, salme, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the human serum albumin. The vehicle may contain minor amounts of 

15 additives that maintain the isotonicity and stability of the polypeptide or other active ingrecUent 
The preparation of such solutions is within the skill of the art. 

4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other composition of the present invention (from whatever source derived, 

including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself or in pharmaceutical compositions where it is mixed with suitable 
carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 

25 may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"phannaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
earner will depend on the route of administration. The pharmaceutical composition of the 

30 invention may also contain cytokines, lymphokines, or otiier hematopoietic factors such as 

M-CSF, GM-CSF, TNF, IL-I, IL.2, IL-3, IL-4, IL-S, IL-6, IL-7, IL-8, IL-9, IL-10, IL-l 1, IL-12, 
E..13, IL-14, IL-15, IFN, TNFO, TNFl, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of tiie disease or disorder in question. These agents 

35 include various growth fectors such as epidennal growth factor (EGF), platelet-derived growth 
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factor (PDGF), transfonning growth fectois CTGF-a and TGF-p), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which eilher enhance 
the activity of the protein or other active ingredient or complement its activity or use in 
5 treatment. Such additional fectors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimi2e side effects. Conversely, protein or other active ingredient of the 
present invention may be included in formulations of the particular dotting factor, cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or and-thrombotic fiactor. or aoti- 
10 mflammatory agent to minimize side effects of the clotting fector, cytokine, lymphokine. other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-mflammatory agent (such as 
IL^Ra, IL-1 Hyl, IL-1 Hy2. anti-TNF, corticostraoids, immunosuppressive agents). A protem 
of the present invention may be active in multimers (e.g., heterodimers or homodimers) or 
complexes with itself or other FOteins. As a result, pharmaceutical compositions of the 
15 invention my comprise a protein ofthe invention in such multimeric or complexed forni. 

As an alternative to being mcluded in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g. , at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 
20 Techniques for formulation and administration of the compounds of the instant appUcation may 
be found m "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or ameUoration of the 
relevant medical condition, or an increase in rate of treatment, healing, prevention or 
25 ameUoration of such conditions. When appUed to an individual active ingredient, administered 
alone, a therapeuticaUy effective dose refers to that ingredient alone. When appUed to a 
combination, a therapeutically effective dose refers to combined amounts of the active 
ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protem or other active ingredient of the present invention is administered to 
a mammal having a condition to be treated. Protein or other active ingredient of the present 
invention may be administered in accordance with the method of the invention either alone or in 
combinadon with other therapies such as treatments employmg cytokines, lymphokines or other 
35 hematopoietic fiictors. when co- administered with one or more cytokines, lymphokines or other 
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hematopoietic factors, protein or other active ingredient of the present invention may be 
administered either simnltaneoTjsly with the cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or anti-thrombotic fectors, or sequentially. If administered sequentially, 
the attending physician will dedde on the appropriate sequence of administering protein or other 
5 active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic fectras. 

4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 
10 intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intrameduUary injections, as well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal, intranasal, or intraocular injections. Administration of protdn or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 
method of the present mvention can be carried out in a variety of conventional ways, such as oral 
15 ingestion, inhalation, topical t^Ucation or cutaneous, subcutaneous, intraperitoneal, parenteral 
or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer tiie compound in a local rather than systemic manner, for 
example, via injection of the compound direcfly into a arthritic joints or in fibrotic tissue, often 
in a depot or sustained release formulation. In onler to prevent the scarring process frequently 
20 occurring as compUcation of glaucoma surgery, the compounds may be administered topicaUy, 
-for example, as eye drops. Furthermore, one may administer the drug in a targeted drug deUvery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
ithritic or fibrotic tissue. The liposomes wiU be targeted to and taken up selectively by the 
afflicted tissue. 

25 The polypeptides of the invention are administered by any route tiiat delivers an effective 

dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
wound treatment, one administers tiie therapeutic compound directly to the site. Suitable dosage 
ranges for the polypeptides of tiie invention can be extrapolated from these dosages or from 

30 similar studies in appropriate animal models. Dosages can tiien be adjusted as necessary by the 
clinician to provide maximal ther^utic benefit. 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may 
35 be formulated in a conventional manner using one or more i^ysiologically acceptable carriers 
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comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used phaimaceuticaily. These pharmaceutical compositions may be 
manufectured in a manner that is itself known, e,g., by means of conventional mixing, 
dissolving, granulating, dragee-making, levigating, eraulsifymg, enc^suladng, entrapping or 

5 lyophilizmg processes. Proper fonnulation is dependent upon the route of administration 
chosen. When a flierapeutically effective amount of protein or otiier active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
invention will be in flie form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 

10 carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 
90% protein or other active ingredient of the present invention. When administered in liquid 
fonn, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid fomi of the 

1 5 pharmaceutical composition may fiirflier contain physiological saline solution, dextrose or otiier 
saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyetiiylene glycol. 
When admmistered in liquid form, the phannaceutical composition contains from about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amount of protein or other active mgredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of tiie present invention will be in the form of a pyrogen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 
active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 

25 the skill in tiie art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 
Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 
other vehicle as known in the art. The pharmaceutical composition of the present invention may 

30 also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 
solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to tiie bairier to be permeated are used in flie formulation. Such penetrants are generally known 

35 in tiie art. 
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For oral administration, the compounds can be formulated readily by combining the 
active compounds with phannaceutically acceptable cairiers well known inlhe art. Such earners 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 

5 treated. Pharmaceutical preparations for oral use can be obtained ftom a soUd excipient, 
optionally grindmg a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiUaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
particular, fillers such as sugars, including lactose, sucrose, mamiitol, or sorbitol; ceUulose 
preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 

10 gum tragacanth, methyl cellulose, hydroxypropyhnethyl-cellulose, sodium 

carboxymethylcellulose. and/or polyvinylpyrroMone (PVP). If desired, disintegrating agents 
may be added, such as the cross-linked polyvinyl pyrrohdone, agar, or alginic add or a salt 
thereofsuch as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 

1 5 talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
sohitions. and suitable organic solvents or solvent mbrtures: Dyestuffe or pigments may be 
added to the tablets or dragee coatings for identification or to characterize different combinations 

of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

20 gelatin,asweUassoft,sealedcapsulesmadeofgelatinandaplastici2er,suchasglycerolor 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, bmders such as startjhes, and/or lubricants such as talc or magnesimn steatate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
suitable Uquids, such as fetty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 

25 stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

For administration by inhalation, tire compounds for use according to the present 
invention are convenieutiy deUveted in the form of an aerosol spray presentation from 

30 pressurized packs or a nebuliser, with tiie use of a suitable propellant, e.g., 

dichlorodifluoromefeane. trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 
other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount Capsules and cartridges of, e.g., gelatin for use 
in an inhaler or insufOator may be formulated containing a powder mix of the compound and a 

35 suitablepowderbasesuchaslactoseorstarch. The compounds may be formulated for parenteral 



67 



PCT/USOl/03800 

WO 01/57188 

admiidstration by mjection. e.g. , by bolus injection or continuous infusion. Fonnuktions for 
injection may be presented in unit dosage form. e.g. . in ampules or in multi-dose containers. vn± 
an added preservative. The compositions may take such forms as suspensions, solutions or 
emulsions in oUy or aqueous vehicles, and may contain foimulatory agents such as suspending. 
5 stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administation include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compomids 
may be prepared as appropriate oUy injection suspensions. Suitable lipophilic solvents or 
vehicles include fetty oUs such as sesame oil, or synflietic fatty acid esters, such as elhyl oleate or 
10 triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase the viscosity of tiie suspension, such as sodium caiboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 
increase tiie solubility of tiie compomKis to allow for tiie preparation of highly concentrated 
solutions. Alternatively, flie active ingredient may be in powder form for constitution witii a 
15 suitable vehicle, e.g., sterile pyrogen-free water, before use. 

Hie compounds may also be formulated in rectal compositions such as suppositories or , 
retention enemas, e.g. . containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, tiie compounds may also be 
foimulatedasadepotpreparation. Such long acting formulations may be administered by 
20 implantation(forexamplesubcutaneouslyorintmmuscularly)orbyintrm 

Thus, for example, tiie compounds may be formulated wifli suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for tiie hydrophobic compounds of tiie invention is a co-solvent 
25 system comprising benzyl alcohol, a nonpolar surfectant. a water-miscible organic polymer, and 
anaqueousphase. lUeco-solvent system may be tiie VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of tiie nonpolar surfactant polysorbate 80. and 65% w/v 
polyefliylene glycol 300. made up to volume in absolute etiianol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted i:l witii a 5% dextrose in water solutioa This co-solvent 
30 system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Natiirally. tiie proportions ofa co-solvent system may be varied considerably 
witiiout destroying its solubiUty and toxicity characteristics. Furthermore, tiie identity of tiie 
co-solvent components may be varied: for example, otiier low-toxidty noi^lar surfactants may 
be used instead of polysorbate 80; flie ftaction size of polyetiiylene glycol may be varied; otiier 
35 biocompatiblepolyinersmay replacepolyetiiylene glycol. e.g. polyvinylpyrrolidone and otiier 
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sugars or polysaccharides may substitute for deKtrose. Alternatively, other deUvery systems for 
hydrophobic phannaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or earners for hydrophobic drugs. Certain organic solvents 
such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
5 AdditionaUy, the compounds may be delivered using a sustained-ielease system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent 
Various types of sustained-release materials have been estabUshed and are wcU known by tiiose 
skilled in the art Sustained-release capsules may, depending on then: chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
10 biological stability of the therapeutic reagent, additional strategies for proton oi: otiier active 
ingredient stabilization may be employed. 

•Die pharmaceutical compositions also may comprise suitable soUd or gel phase carriers 
or excipients. Examples of such carriers or excipients include but arc not limited to calcium 
carbonate, calcium phosphate, various sugars, starches. ceUulose derivatives, gelatin, and 
15 polymers such as polyethylene glycols. Many of tiie active ingredients of the invention may be 
provided as salts vnOi pharmaceutically compatible counter ions. Such phaxmaceutically 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the fiee adds and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, Irialkylamine, dialkylamine. 
20 monoalkylamine, dibasic amino acids, sodimn acetate, potassium benzoate, triethanol amine and 
the like. 

Hie pharmaceutical composition of flie invention may be in tiie form of a complex of tiie 
protein(s) or otiier active ingredient(s) of present invention along witii protein or peptide 
antigens.Theproteinand/orpeptideantigen will deUverastimulatory signal tobotiiB and! 

25 lymphocytes. B lymphocytes wiU respond to antigen through tiieir surface immunoglobulin 
receptor. T lymphocytes wiU respond to antigen through the T cell receptor (TCR) foUowing 
presentation of tiie antigen by MHC proteins. MHC and structuraUy related proteins including 
tiiose encoded by class I and class H MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 

30 MHC-peptide complexes alone or witii co-stimulatoiy molecules tiiat can directiy signal T ceUs. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 
weU as antibodies able to bind tiie TCR and otiier molecules on T cells can be combined witii tiie 
phannaceutical composition of the invention. 

The phannaceutical composition of tiie invention may be in tiie fomi of a liposome in 

35 . vriiich protein of tiie present invention is combined, in addition to otiier phaxmaceutically 
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acceptable carriers, with amphipathic agents such as Upids which exist in aggregated form as 
miceUes, insoluble monolayers, Uquid crystals, or lamellar layers in aqueous solution. Suitable 
lipids for Hposomal formulation include, without limitation, monoglycerides, diglycerides, 
sulfatides, lysoledthins, phosphoUpids, saponin, bUe acids, and the like. Preparation of such 
5 Uposomal fimnulations is within the level of skill in the art, as disclosed, fot example, m U.S. 
Patent Nos. 4,235,871; 4,501.728; 4,837.028; and 4,737,323, aU of which are incoiporated 

herein by reference. 

The amount of protein or other active mgredient of the present invention in the 
pharmaceutical composition of the present invention will depend upon the nature and severity of 

10 the condition being treated, and on the nature of prior treatments wHch the patient has 

midergone. Ultimately, the attendmg physician will decide the amount of protem or other active 
ingredient of the present invention with which to treat each individual patient Initially, the 
attendmg physician wiU administer low doses of protein or other active mgredient of tiie present 
invention and observe the patient's response. Larger doses of protem or.other active mgredient 

15 ofthepresentinventionmay beadministereduntilthe optimal therapeutic effect is obtained for 
the patient, and at that point the dosage is not mcreased fiirther. It is contemplated that tiie 
various pharmaceutical compositions used to practice the method of the present invention should 
contain about 0.01 ngto about 100 mg (preferably about 0.1 ng to about 10 mg, more preferably 
about 0.1 Jig to about 1 mg) of protein or otiier active ingredient of the present invention per kg 

20 body weight For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the tiierapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the tiierapeutic 
composition for use in this mvention is, of course, in apyrogen-ftee, physiologically acceptable 
form. Further, tiie composition may desaably be encapsulated or injected in a viscous form for 

25 delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. TherapeuticaUy usefiil agents otiier tiian a protein or otiier 
active mgredient of tiie mvention which may also optionaUy be included in tiie composition as 
described above, may alternatively or additionally, be admmistered simultaneously or 
sequentially wifli flie composition m tiie mefliods of tiie invention. Preferably for bone and/or 

30 cartilage formation, tiie composition would include a matiix capable of deUvering tiie 

protein-containmg or other active ingredient-contaimng composition to tiie site of bone and/or 
cartilage damage, providmg a structure for tiie develoiring bone and cartilage and optimally 
capable of being resorbed mto tiie body. Such matrices may be formed of materials presentiy in 
use for otiier implanted medical applications. 
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The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and mteifece properties. The particular application of the 
compositions will define the appropriate formulation. Potential matrices for the compositions 
may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 

5 hydroxyapatite, polylactic acid, polyglycolic acid and polyaohydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices ate comprised of pure proteins or extracellular matrix components. Other potential 
matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 

10 mentioned types of material, such as polylactic acid and hydroxy^atite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 
calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presentiy preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in tiie form of porous particles having diameters ranging &om 150 to 800 microns. 

15 In some applications, it will be usefiil to utilize a sequestering agent, such as carboxymethyl 
cellulose or autologous blood clot, to prevent the protein compositions fix)m disassociating fit)m 
the matrix. 

A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 
(including hydroxyalkylcelluloses), including mefliylcellulose, ethylcellulose, 

20 hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-mefliylcellulose, and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC), Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol). 
The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 

25 total formulation weight, which represents flie amount necessary to prevent desorption of the 
protein fiom the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented firom infiltrating the matrix, tiiereby providing tiie 
protein the opportunity to assist tiie osteogenic activity of tiie progenitor cells. In further 
compositions, proteins orother active ingredients of tiie invention may be combined witii otiier 

30 agents beneficial to the treatment of tiie bone and/or cartilage defect, wound, or tissue in 

question. These agents mclude various growfli factors such as epidermal growtii factor (EGF), 
platelet derived growOi factor (PDGF), transfoiming growtii factors (TGF-a and TGF-P), and 

insulin-like growth fector (IGF). 

The tiierapeutic compositions are also presentiy valuable for veterinary ^plications. 
35 Particularly domestic animals and tiioroughbred horses, in addition to humans, are desired 
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patients for such treatment with proteins or other active ingredients of Hie present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration vdll be detenuincd by the attending physician considering various fectprs which 
modify the action of theprotems, e.g.. amount of tissue weight desired to be formed, the site of 

5 damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. Hie dosage may vary with the type of matrix used in the reconstitution 
and with inclusion of other proteins m the pharmaceutical composition. For example, the 
addition of other known growth fectors, such as IGF I (insulin like growth fector I), to the final 

10 composition, may also efifectthe dosage. Progress can be monitored by periodic assessment of 
tissueA)one growth and/or repair, for example. X-rays, histomorphometric determinations and 

tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
15 mammaHan subject Polynucleotides of Uie invention may also be administeredby other known 
methods for introduction of nucleic acid into a cell or organism Cmcluding, without limitation, in 
the form of viral vectors or naked DNA). CeUs may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. T^ed cells can then be introduced in vivo for therapeutic purposes. 

20 

4.123 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 
intended purpose. More specifically, a therapeutically effective amount means an amount 

25 effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Deteimination of the effective amount is well within tiie capabiUty of tiiose skilled in 
the art. especially in light of the detailed disclosure provided herein. For any compound used in 
tiie mefliod of the invention, the therapeutically effective dose can be estimated initiaUy from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 

30 circulating concentration range that can be used to more accurately deteimine usefiil doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 
concentration range that includes the IC50 as detemiined in ceU culture (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses m humans. 
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. A therapeuticaUy efifective dose refers to lhat amount of the compound lhat results in 
ameHoration of symptoms or a prolongation of survival in a patient Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in ceU 
cultures or experimental animals, e.g., for detemuning the LDso (the dose lethal to 50% of the 
5 population) and the EDjo (Ae dose therapeutically efifective in 50% of the population). The dose 
ratio betvifeen toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LDjo and EDso. Compounds which exhibit high tiierapeufic indices are preferred. 
The data obtained from these ceU culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds Ues preferably within a range 
10 of drculating concentrations that include the EDso with Utile or no toxicity. The dosage may 
vary within this range depending upon the dosage form employed and tiie route of administration 
utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The 
Phannacological Basis of Therapeutics". Ch. 1 p.l. Dosage amount and interval may be adjusted 
15 individually to provide plasma levels of the active moiety which are sufficient to maintain the 
desired effects, or minimal effective concentration (MEC). The MEC will vary for each 
compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC wiU 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 
20 Dosage intervals can also be determined using MEC value. Compounds should be 

administered using a regimen which maintains plasma levels above tiie MEC for 1 0-90% of the 
• time, preferably between 30-90% and most prefisrably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of tiie drug may not be 
related to plasma concentration. 
,25 An exemplary dosage regimen for polypeptides or other compositions of the invention 

WiU be in the range of about 0.01 jig/kg to 100 mg/kg of body weight daily, with the preferred 
dose bemg about 0.1 ng/kg to 25 mg/kg of patient body weight daily, varying in adults and 
children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

30 ' The amount ofcomposition administered WiU, ofcourse, be dependent on the subject 
being treated, on the subject's age and weight, tiie severity of tiie affUction, the manner of 
administration and the judgment of tiie prescribing physician. 
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The compositions may, if desired, be piesented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a bHster pack. The pack or dispenser device may 
be accompanied by instructions for administration. Compositions comprising a compound of tiie 
5 invention formulated in a compatible phannaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 



4.13 ANTIBODIES 

Also included in tiie invention are antibodies to proteins, or fragments of proteins of tiie 

10 invention. Ite tma "antibody" as used herein refers to immunoglobulin molecules and 

immunologically active portions of mmmnoglobulin Qg) molecules, ie., molecules that contain 
an antigenbmding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F«b, Fab- and F(ab02 
fragments, and an F.b expression Ubrary. In general, an antibody molecule obtained from 

15 humans relates to any of the classes IgG. IgM, IgA, IgE and IgD, which differ from one anotiier 
by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well. 
suchasIgG,,IgG2.andotiieis. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to aU such classes, 
subclasses and types of human antibody species. 

20 An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fragment fliereol^ and additionally can be used as an immunogen to generate 
antibodies tiiat immunospecifically bind tiie antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The foil-length protein can be used or, alternatively, tiie 
invention provides antigenic peptide fragments of tiie antigen for use as immunogens. An 

25 antigenic peptide fragment comprises at least 6 amino acid residues of tiie amino acid sequence 
of flic fuU lengfli protein, (for example tiie amino acid sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope tiiereof such tiiat an antibody raised against tiie peptide forms a 
specific immune complex witii tiie fiiU lengfli protein or witii any fragment tiiat contains tiie 
epitope. Preferably, flie antigenic peptide comprises at least 10 amino acid residues, or at least 

30 15 ammo acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by tiie antigenic peptide are regions of tiie protein fliat are 
located on its surfece; commonly tiiese are hydrophilic regions. 

In certain embodiments of tiie invention, at least one epitope encompassed by tiie 
antigenic peptide is a region of -related protein fliat is located on flie surfece of flie protein, e.g., a 

35 hydrophilic region. A hydrophobicity analysis of tiie human related protein sequence wiU 
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indicate wbich regions of a related protein are particularly hydrophiUc and, therefore, are likely 
. to encode surfece residues useful for targeting antibody productioiL As a means for targeting 

antibody production, hydropathy plots showing regions of hydrophiUcily and hydrophobicity 

may be generated by any method well known in the art, including, for example, the Kyte 
5 DooHtae or the Hopp Woods methods, either with or without Fourier transformation. See, e.g., 

Hopp and Woods, 1981, Proc. Nat Acad. Set USA 78: 3824-3828; Kyte and Doolittle 1982. J. 

Mol Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 

Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 

fragments, analogs or homologs thereof, are also provided herein. 
10 A protein of the invention, or a derivative, fragment, analog, homolog or orlholog 

thereof, may be utilized as an immunogen in the generation of antibodies tiiat 

immunospecifically bind these protein components. 

Various procedures known withm the ait may be used for the production of polyclonal or 

monoclonal antibodies directed against a protein of the invention, or against derivatives, 
15 fragments, analogs homologs or orthologs thereof (see, for example. Antibodies: A Laboratory 

Manual, Harlow E, and Lane D. 1988, Cold Spring Harfjor Laboratory Press, Cold Spring 

Harbor. NY, incorporated herein by reference). Some of these antibodies are discussed below. 

5.13.1 Polyclonal Antibodies 

20 For the production of polyclonal antibodies, various suitable host animals {e.g., rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturaUy occurring immunogenic 
protein, a chemicaUy synthesized polypeptide representing the immunogenic protein, or a 

25 recombinantly expressed immmiogenic protein. Furthennore, the protein may be conjugated to 
a second protein knownto be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. Hie preparation can forther include an 
adjuvant Various adjuvants used to increase the immunological response include, but are not 

30 limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
active substances (eg., lysolecithin. pluronic polyols, polyanions, peptides, oil emulsions, 
dinitrophenol, ete.), adjuvants usable in humans such as Bacille Calmette-Guerin and 
Corynebacterium parvum. or similar immunostimulatory agents. Additional examples of 
adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 

35 synthetic trdialosedicorynomycolate). 
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The polyclonal antibody molecules directed against tbe immunogenic protein can be 
isolated from the mammal (e.g. , from the blood) and ftirther purified by weU known techniques, 
such as afSnity chromatography using protein A or protein G, which provide primarily the IgG 
fraction of immune serum. Subsequently, or alternatively, the specific antigen wbich is the 
5 target of the unmunoglobulin sought, or an epitope thereof; may be immobilized on a column to 
purify the immune specific antibody by immunoafiBnitydiromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 
Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

10 5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of aunique Ught chain gene product and a unique heavy cham 
gene product In particular, the complementarity determining regions (CDRs) of the monoclonal 

15 antibody are identical in all the molecules of the populatioa MAbs thus contain an antigen 
binding site capable of immunoreacting with a particular epitope of the antigen characterized by 
a unique binding afSnify for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, 

20 hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specificaUy bind 
to the immunizing agent. Ahematively, the lymphocytes can be immunized in vitro. 

The immnniTing agent will typically include the protein antigen, a fiagment thereof or a 
fiision protein liiereof Generally, either peripheral blood lymphocytes are used if cells of human 

25 origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusmg agent, such as polyethylene glycol, to form a hybridoma ceU (Coding, Monoclonal 
Antibodies: Principle-^ and PracHce. Academic Press, (1986) pp. 59-103). ImmortaUzed ceU 
lines are usually transformed mammalian cells, particularly myeloma ceUs of rodent, bovine and 

30 human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of die un&sed, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typicaUy will include hypoxanthine, aminopterin, and 

35 thymidine CHAT medium"), which substances prevent the growth of HGPRT-defident cells. 
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Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More prefened immortalized ceU lines are murine myeloma lines, which 
canbe obtained, for instance, from the Salk Institute CeU Distribution Center. San Diego, 
CaliforniaandAeAmericanTVpe Culture CoUection, Manassas. Virginia. Human myeloma and 
mouse-human heteromyeloma ceU lines also have been described for the production of human 
monoclonal antibodies (Kozbor. L^mufioL J3i:3001 (1984); Brodeur et al.. MSS2SM 
Antihndv Productinii Tftchnianes and Applications. Maicel Dekker, Inc.. New York, (1987) pp. 

51-63). 

Hie culture medium in which the hybridoma cdls are cultured can then be assayed for 
the presence of monoclonal antibodies directed againstthe antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an m vitro bindmg assay, such as radioimmunoassay (RIA) or 
enzym^linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. Hie binding affinity of the monoclonal antibody can, for example, be detennined by the 
Scatchard analysis of Munson and Pollard, Angl. BiQchem. 152:220 (1980). Preferably, 
antibodies havingaMgh degree of specificity andaWgh bindmg affinity for the target antig^ 

are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 
dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 

dialysis, or affinity diromatogr^y. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced usmg conventional procedures {e.g., by using 
oUgonucleotide probes that are capable of bmding specifically to genes encoding the heavy and 
Ught chains of murine antibodies). The hybridoma cells of the imrention serve as a prefened 
source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as shnian COS ceUs, Chinese hamster ovary (CHO) ceUs. or 
myeloma cells that do not otherwise produce immunoglobulin protein, tx) obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 
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example, by substituting the coding sequence for human heavy and Ugbt chain constant domains 
in place of the homologous murine sequences (U.S. PatentNo. 4,816^67; Morrison, Jtoffe 268, 
812-13 (1994)) or by covalently joining to the unmunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
5 polypeptide can be substituted for the constant domains ofan antibody ofthe invention, or can 

be substituted for the variable domains of one antigen-combining site ofan antibody ofthe 
invention to create a chimeric bivalent antibody. 

5.13^ Humanized Antibodies 

10 nie antibodies directed against the protein antigens of the invention can further comprise 

humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by tiie human against the administered 
inmunoglobulin. Humani28d forms of antibodies are chimeric immunoglobulins, 

immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', m'h <« other antigen- 

15 binding subsequences of antibodies) that are principally comprised ofthe sequence of ahuman 
immunoglobulin, and contam minimal sequence derived from a non-human unmunoglobulin. 
Humanization can be peifonned following themefliod of Winter and co-workers (Jones et al., 
Nature. 321:522-525 (1986); Riechmann et al.. Nature, 332:323-327 (1988); Verhoeyen et al.. 
Science. 222:1534-1536 (1988)). by substituting rodent CDRs or CDR sequences for tiie 

20 coiiespondingsequencesofahumanantibody.(SeealsoU.S.PatentNo.5,225,539.) Insome 
instances, Fv ftamewoik residues ofthe human immunoglobulm are replaced by conesponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the hnported CDR or framework sequences. In general, the 
humanized antibody wiU comprise substantially all of at least one, and typically two, variable 

25 domains, in which all or substantially all of flie CDR regions correspond to tiiose of a non-human 
inmiunoglobulin and aU or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimaUy also wiU comprise at 
least a portion ofan immunoglobulin constant region (Fc), typically tiiat of a human 
immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, C»rr. Op. Struct. Biol., 

30 2:593-596(1992)). 

5.13.3 Human Antibodies 

Fufly human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
35 genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
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Human monoclonal antibodies can be prepared by tbe trioma technique; the human B-ceU 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: Monoclonal 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 

5 antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybiidomas (see Cote, et al., 1983. ProcNatl Acad Sci USA 80: 2026-2030) or by 
transforming human B-ceUs with Epstein Barr Vims in vitro (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 

10 including phage display libraries (Hoogenboom and Winter, I MqI. Bwl, 222:381 (1991); 
Marks et al., J.Mol.Biol.. ^:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgaiic animals, e.g., mice in vkiiich the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles tiiat seen in humans 

15 in aU respects, including gene leanangement, assembly, and antibody repertoire. This approach 
is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5.625,126; 
5,633.425; 5,661,016, and in Marks et al. (Bio/Technologv 10. 779-783 (1992)); Lonberg et al. 
fl^amre 368 856-859 (1994)); Morrison ( Nature m 812-13 (1994)); Fishwild et al,( Nafiuie 
Biotechnology 14, 845-51 (1996)); Neuberger rNfahire Biotechnology 14. 826 (1996)); and 



20 Lonbag and Huszar antem. Rev. Tmmimnl. 13 65-93 (1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than tiie animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT pubKcation WO94/02602). The 
endogenous genes encoding the heavy and light immunoglobulin chains in tiie nonhuman host 

25 have beoi incapacitated, and active loci encoding human heavy and tight chain unmunoglobulins 
arc inserted into tiie host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 
provides all tiie desired modifications is tiien obtained as progeny by crossbreeding intermediate 
transgeidc animals containing fewa: tiian the full complement of tiie modifications. The 

30 preferred embodimait of such a nonhuman animal is a mouse, and is termed the Xenomouse 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 
cells which secrete fully human immunoglobuhns. The antibodies can be obtained directiy firom 
tiie' animal after nnmunization witii an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively fiom immortalized B cells derived fixjm tiie animal, such as 

35 hybridomas producing monoclonal antibodies. Additionally, tiie genes encoding tiie 
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immunoglobulins with Iniman variable regions can be recovered and expressed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 
example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 

5 expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 
5,939,598. It can be obtained by a method including deleting the J segment genes fi:om at least 
one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
locus and to prevent formation of a transcr^ of a rearranged unmunoglobulm heavy chain locus, 
the deletion being effected by a targeting vector containmg a gene encodmg a selectable marker; 

10 and producing from the emhtyonic stem cell a transgenic mouse whose somatic and germ cells 
contain the gene encoding the selectable marker. 

A method for prbducmg an antibody of interest, such as a human antibody, is disclosed in 
U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 
nucleotide sequence encodmg a heavy chain mto one mammalian host cell m culture, introducing 

15 an expression vector containing a nucleotide sequence encoding a light chain into aiiother 

mammaUan host ceU, and fiising the two ceUs to form a hybrid ceU. The hybrid ceU expresses an 
antibody containing the heavy cham and the light chain. 

hi a further unprovement on this procedure, a method for identifying a clinically relevant 
epitope on an immunogen, and a correlative method for selecting an antibody that binds 

20 immunospedfically to the relevant epitope with high afiSnity, are disclosed in PCT pubHcation 
WO99/53049. 

5.13.4 F.b Fragments and Single Chahi Antibodies 

According to the invention, techniques can be adapted for die production of single-chain 
25 antibodies specific to an antigenic protein of the mvention (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be ad^d for the construction of Fab expression libraries (see e.g., 
Huse, et al., 1989 Sdence 246: 1275-1281) to aUow rapid and effective identification of 
monoclonal Fa, fragments with the desired specificity for a protein or derivatives, firagments, 
analogs or homologs tiiereof. Antibody fragments tiiat contain the idiotypes to a protein antigen 
30 may be produced by techniques known in tiie art including, but not lunited to: (i) an.F(ab72 

fragment produced by pepsm digestion of an antibody molecule; (ii) an Fab fragment generated 
by leducmg the disulfide bridges of an V(am fragment; (iii) an Fab fragment generated by the 
treabnent of tiie antibody molecule wifli papam and a reducmg agent and (iv) Fv fragments. 

35 5.13.5 Bispedfic Antibodi s 
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Bispedfic antibodies are monoclonal, preferably hiinan or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
olher antigen, and advantageously is a ceU-surfece protein or receptor or receptor subunit 
5 Methods for making bispedfic antibodies are known in the art. Traditionally, the 

recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-diain/Ught-chain pairs, where tiie two heavy chains have different 
spedfidties (Milstdn and Cuello, ^ms. 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and Ught chains, these hybridomas (quadromas) produce a 
10 potential mixture often different antibody molecules, ofwhich only one has the correct 

bispedfic structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829. pubUshed 13 May 
1993, and inTraunecker etal., 1991 EMBOJ., 10:3655-3659. 

Antibody variable domains vwtii the desfaed binding specificities (antibody-antigen 
15 combining sites) can be fused to immunoglobulin constant domain sequences. The fiision 

preferably is witii an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge. CH2, and CH3 regions. It is preferred to have tiie first heavy-cham constant region 
(CHI) containing tiie site necessary for Ught-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-diain fusions and, if desired, the immunoglobulin 
20 light chain, are inserted into sfcparate expression vectors, and are co-transfected into a suitable 
host organism. For fiirfher details of generating bispecific antibodies see. for example, Suresh et 
al.. Methods in F.nzvmologv. 121:210 (1986). 

According to another approadi described in WO 96/2701 1, tiie interface between a pair 
of antibody molecules can be engineered to maximize tiie percentage of heteiodimers which are 
25 recovered from recombinant cell cultoire. The preferred interfece comprises at least apart of tiie 
CH3 region of an antibody constant domain. Intiiis metiiod, one or more small amino acid side 
chains fiom flie interfece of tiie first antibody molecule are replaced witii larger side chains (e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to tiie large side 
chain(s) are created on die interface of the second antibody molecule by replacing large amino 
30 acid side chains witii smaUer ones (e.g. alanine or tiireonme). This provides a medianism for 
increasing tiie yidd of tiie heterodimer over otiier unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full lengtii antibodies or antibody fragments (e.g. 
F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in tiie literatiire. For example, bispedfic antibodies can be 
35 prepared usmg chemical linkage. Brennanetal.,Sd^ 229:81 (1985) describe a procedure 
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wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
fragments are reduced in the presence of the dithiol complexuig agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 

5 derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form tiiie bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 

10 coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' fi:agment 
was separately secreted from K coU and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as weU as trigger the lytic activity 

1 5 of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly from 
recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucme zippers. Kosteky et al., J. Immunol. 1 48(5): 1 547-1 553 (1 992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Prnc.Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fi^agments. The fragments comprise a 

25 heavy-chain variable domain (Vh) connected to a light-chain variable domain (Vl) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and Vl domains of one fragment are forced to pair with the complementary Vl and Vh 
domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 

30 reported. See, Gruber et al., J. Immunol. 1 52:5368 (1 994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991), 
Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
origuiates in the protein antigen of the invention. Alternatively, an anti-antigenic aim of an 

35 immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
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a leukocyte such as a T-ceU receptor molecule (e.g. CD2, CD3, CD28, or B7). or Fc receptors for 
IgG(Fc R),sudiasFc RI(CD64),Fc RII (CD32) and Fc RIH (CD16) so as to focus 
ceUular defense mechanisms to the ceU expressing the particular antigen. Bispecific antibodies 
can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
5 antibodies possess an antigen-binding aim and an arm which binds a cytotoxic agent or a 

radionucUde chelator, such as EOTUBE, DPTA, DOTA, or TETA. Anoflier bispedfic antibody 
of interest binds the protein antigen described herein and further binds tissue fector (TF). 

S.13.6 Heteioconjugate Antibodies 

1 0 Heteroconjugate antibodies are also within flie scope of tiie present inventioa 

Heteroconjugate antibodies are composed of two covalentiy joined antibodies. Such antibodies 
have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4.676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known melhods in syntiietic 

15 protein chemistry, including tiiose involving crosslinking agents. For example, immunotoxins 
can be constracted using a disulfide exchange reaction or by forming athioetiier bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and fliose disclosed, for example, in U.S. Patent No. 4,676,980. 

20 5.13.7 Effector Function Engineering 

It can be desirable to modify tiie antibody of die invention wifli respect to effector function, so as 
to enhance, e.g.y die efectiveness of the antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into tiie Fc region, tiiereby allowing interchain disulfide bond 
formation in this re^on. The homodimeric antibody tiius generated can have improved 

25 internalization capability and/or increased complement-mediated ceU killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med, 176: 1191-1 195 (1992) 
and Shopes. J. Immunol., 148: 2918-2922 (1992).' Homodimeric antibodies witii enhanced anti- 
tumor activity can also be prepared using heterobiftinctional cross-linkers as described in Wolff 
etal. Caucer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered tiiat 

30 has dual Fc regions and can tiiereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

5.13.8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
35 cytotoxic agent such as a chemotiierapeutic agent, toxin (e.g., an enaymatically active toxin of 
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bacterial, fiingal, plant, or animal origin, or fragments thereof), or a mdioactive isotope (i.e. , a 
ladioconjv^ate). 

Chemotherapeutic agents useful in Uie generation of such immunoconjugates have been 
described above. Enzymatically active toxins and fragments thereofthat can be used include 
5 dphtheriaAchain,nonbindingactivefragmentsofdiphtberiatoxin.exotoxinAch^ 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, diantWn proteins. Phytolaca americana proteins (PAPl, PAPH, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelomn, 
mitogelUn, restrictocin, phenomydn, enomycin, and the tricolhecenes. A variety of 
10 radionuclides are available for the production of ladioconjugated antibodies. Examples include 
212BJ uij iJijjj 90Y^ and"*Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifimctional 
protein-coupling agents such as N-succinimidyi-3-(2-pyridyldithiol) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidateHCL). 
15 active esters (such as disuccinimidyl suberate), aldehydes (such as glutaieldehyde), bis^do 
compounds (such as bis (p-azidobenzoyi) hexanfediamine). bis-diazonium derivatives (such as 
bis^azoniumben2oyl>ethylenediamine), diisocyanates (such as tolyene 2.6.diisocyanate), 
and bis-active fluorine compounds (such as 1.5-difluoro-2.4^trobenzene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et al.. Science. 238: 1098 (1987). 
20 Cari3on.l4-labeled l.isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargetingwherem the antibody-receptor conjugate is 
25 administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "Ugand" (e.g., avidin) tiiat is intum 
conjugated to a cytotoxic i^ent 

4.14 COMPUTER READABLE SEQUENCES 
30 In one appUcation of this embodiment, a nucleotide sequence of the present invention can 

be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium v^hichcanbe read and accessed directiybyacomputer. Such media include, but 

are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
35 andROM;andhybridsoftiiesecategoriessuchasmagnetic/opticalstoragemedia. Askilled 
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artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of Ihe present invention. As used herein, "recorded" refers to a process for 
storing mformation on computer readable medium. A skilled artisan can readily adopt any of the 

5 presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence mformation of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means chosen 

1 0 to access tiie stored information. In addition, a variety of data processor programs and formats 
cau be used to store the nucleotide sequence mformation of flie present invention on computer 
readable medium. The sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented m the form of an ASCII file, stored in a database application, such as DB2, Sybase, 

1 5 Oracle, or the like, A skilled artisan can readily adapt any number of data processor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

By providing any of the nucleotide seqiiences SEQ ID NO:1-1350 or a representative 
fiagment tiiereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 

20 sequences of SEQ ID NO: 1-1350 m computer readable form, a skilled artisan can routinely 
access the sequence infomiation for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples Mdiich follow demonstrate how software which implements tiie 
BLAST (Altschul et al„ J. MoL Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 

25 Chem. 17:203-207 (1993)) search algoriflmis on a Sybase system is used to identify open reading 
fi-ames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding firagments 
and may be useful in producing commercially important proteins such as enzymes used in 
fennentation reactions and in the production of commercially usefiil metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 

30 means, and data storage means used to analyze the nucleotide sequence infonnation of the 

present invention. The mmimum hardware means of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of tiie currentiy available 
computer-based systems are suitable for use in the present invention. As stated above, the 

35 computer-based systems of the present mvention comprise a data storage means havmg stored 
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softwaremeansforsupportingandiinplementingasearchmeans. As used herran, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
invention, or a memory access means which can access manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare atarget sequence or target structural motif with the 
sequenceinformationstoredvwthinthedatastomgemeans. Search means are used to identify 
fragments or regions of aknown sequence which match a particular target sequence or target 
motif. A variety of known algoritiuns are disclosed pubUcly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 
systems of the present mvention. Examples of such software includes, but is not limited to. 
Smith-Watemian. MacPattem (EMBL). BLASTO and BLASTA (NPOLYPEPTTDEIA). A 
skilled artisan can readily recognize tiiat any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in tiie present 
computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readUy recognize tiiat a>e longer a target sequence is. tiie less likely a target sequence will be 
present asarandom occurrence in the database. The most preferred sequence length ofatarget 

sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fiagments involved m gene expression and protein processing, may be of 
shorter length. 

As used herein, "a target structural motif," or "target motif." refers to any rationally 
selected sequence or combination of sequences in which tiie sequence(s) are chosen based on a 
tiiree-dimensional configuration which is formed upon tiie folding of the target motif. TTiere are 
a variety of target motifs known in tiie art. Protein target motifs include, but are not limited to. 
enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 



4,15 TRIPLE HELIX FORMATION 

In addition, tiie fragments of tiie present invention, as broadly described, can be used to 
control gene expression flirough triple helix fomiation or antisense DNA or RNA. botii of which 
metiiods are based on flie binding of a polynucleotide sequence to DNA or RNA. ' 
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Polynudeotides suitable for use to these methods are preferably 20 to 
designed to be complementaiy to a region of the gene involved in transcription (triple helix - see 
Lee et al., Nucl. Adds Res. 6:3073 (1979); Cooney et al.. Science 15241 :456 (1988); and Dervan 
et al.. Science 251:1360 (1991)) or to the mRlSIA itself (antisense - Olmno, J. Neurodiem. 

5 56:560 (1991); OUgodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA. while antisense RNA hybridization blocks translation of an mRNA 
molecule into polypeptide. Both tedmiques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 

10 design ofan antisense or triple helix oligonucleotide. 

4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
15 acidprobeorantibodiesofthepn:smtinvention,optionallyconjugaledorotherw^ 

vnth a suitable label. 

In general, methods for detecting a polynudeotide of the invention can comprise 
contacting a sample vnth a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 
20 detected,apolynudeotideoftheinventionisdetectedin1hesample. Such methods can also 
comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that amieal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 
detected in the sample. 

25 In general, methods for detecting a polypeptide of the invention can comprise contacting 

a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 
polypeptide of ihe invention is detected in &e sample. 

In detail, such methods comprise incubating a test sample with one or more of the 

30 antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucldc acid probes or antibodies to components within the test sample. 

Conditions for incubating anucleic acid probe or antibody with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic add probe or antibody used in the assay. One 

35 skilled in the art will recognize that any one of the commonly available hybridization. 
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amplification or immunological assay formats can readily be ad^ to employ the nucleic add 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Inlioduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1986); Bullock, G.R. et al.. Techniques in Immunocytochemistry, 
5 Academic Press. Orlando, FL Vol. 1 (1982). Vol. 2 (1983), VoL 3 (1985); Tyssen, P., Practice 
and Tlieory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers. Amsterdam. Ite Netherlands (1985). The test samples of the 
present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sarixple used m the above^lescribed method 
10 wiU vary based on the assay format, nature of the detection method and the tissues, cells or 
extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 
extracts of cells are well kno>vn in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
15 necessaryreagentstocarryouttheassaysofthepresentinvention. SpecificaUy, the invention 
provides a compartment kit to receive, in dose confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other contamers comprising one or more of the following:, wash 
reagents, reagents capable of detecting presence of abound probe or antibody. 
20 indetaU, acompartmentkit includes any kit m which reagents are contained in separate 

containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents &om one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 
agents or solutions of each container can be added in a quantitative fashion from one 
25 compartment to another. Such containers will include a container which will accept the test 
sample, a contamer which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-bufifers. etc.). and containers which 
contam the reagents used to detect the bound antibody or probe. Types of detection reagents 
mclude labeled nucleic acid Fobes. labeled secondary antibodies, or in the alternative, if the 
30 primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 
probes and antibodies of the present uivention can be readily incorporated into one of the 
established kit formats which are well known in the art. 



35 4.17 MEDICAL IMAGING 
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Ite novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
See, e.g., Kunkel et al., U.S. Pat NO. 5,413,778. Such methods involve chemical attachment of 
5 a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 

pharmaceuticaUy acceptable carrier, and imaging flie labeled polypeptide in vivo at the target 
site. 



4.18 SCREENING ASSAYS 
10 Using the isolated protems and polynucleotides of the invention, the present invention 

further provides methods of obtaining and identifying agents which bind to a polypeptide 
encoded by an ORF corresponding to any of the nucleotide sequences set forth m SEQ ED N0 :1- 
1350, or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, said 

method comprises the steps of: 
15 (a) contacting an agent vyith an isolated protein encoded by an ORF of tiie present 

invention, or nucleic add of the invention; and 

(b) deteimining whether the agent binds to said protein or said nucleic add. 

In general, therefore, such methods for identifying compounds that bind to a 
polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
20 the invention for a time sufBcient to fomi a polynucleotide/compound complex, and detecting 
the complex, so that if a polynucleotide/compound complex is detected, a compound tiiat binds 
to a polynucleotide of tiie invention is identified. 

Likewise, in general, tiierefore, such methods for identifying compounds that bind to a 
polypeptide of the invention can comprise contacting a compound witii a polypeptide of tiie 
25 invention for a time sufBcient to form a polypeptide/compound complex, and detecting tiie 
complex, so tiiat if a polypeptide/compound complex is detected, a compound that binds to a 
polynucleotide of the invention is identified. 

Metiiods for identifying compounds tiiat bind to a polypeptide of tiie invention can also 
comprise contacting a conq)Ound witii a polypeptide of tiie invention in a cell for a time 
30 sufficient to form a polypeptide/compound complex, wherein tiie complex drives expression of a 
receptor gene sequence in tiie ceU, and detecting tiie complex by detecting reporter gene 
sequence expression, so tiiat if a polypeptide/compound complex is detected, a compound tiiat 
binds a polypeptide of the invention is identified. 

Compounds identified via such mefliods can include compounds which modulate tiie 
35 activify of a polypeptide of tiie invention (tiiat is, increase or decrease its activity, relative to 
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activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds which modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
absence of fhc.compound). Compounds, such as compounds identified via the methods of the 
5 invention, can be tested uang standard assays vreUknovra to those of skill in the art for their 

ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationaUy selected or designed using protein modeling techniques. 
10 For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 

the like are selected at random and are assayed for tiieir abiUty to bind to (he protein encoded by 
the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationaUy selected or designed" when the agent is chosen 
based on the configuration of the particular protein: For example, one skilled in the art can 

15 readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., AppUcation of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide. W.H. Freeman, NY (1992). pp. 289-307. and 
Kaspczak et al.. Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

20 In addition to the foregoing, one class of agents of the present invention, as broadly 

described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating flie expression of either a single ORF or 

25 multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 
ribonucleic add backbone, or can be a variety of sulfliydryl or polymeric derivatives which have 
base attachment capacity. 

30 Agents suitable for use in these methods preferably contain 20 to 40 bases and are 

designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et 
al.. Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano. J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press. Boca 

35 Raton, FL (1988)). Triple helix-formation optimally results m a shut-off of RNA transcription 
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from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective m model systems. 
Infoimation contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide and other DNA binding agents. 
5 Agents which bind to a protein encoded by one of the ORFs of the present invention can 

be used as a diagnostic agent Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 
composition. 

10 4.19 USE OF NUCLEIC ACroS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
hybridizatipn probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NO:1-1350. Because the corresponding gene is only expressed in a lunited 

15 number of tissues, a hybridization probe derived from of any of the nucleotide sequences SEQ 
ID NO:1-1350 can be used as an indicator of tiie presence of RNA of cell type of such a tissue in 
a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PGR as described m US Patents Nos. 4,683,195 and 4.965,188 provides 

20 additional uses for oUgonucleotides based upon the nucleotide sequences. Such probes used in 
PGR may be of recombinant origin, may be chemicaUy synthesized, or a mixture of both. The 
probe wiU comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 
Other means for producing specific hybridization probes for nucleic acids include the 

25 cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 
polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping thefr respective genomic sequences. The 

30 nucleotide sequence provided herein may be mapped to a chromosome or specific regioiis of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 
techniques include in situ hybridization, Unkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
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chromosome spreads has been described, among other places, in Venna et al (1 988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New YorkNY. 

Huorescent in situ hybridization of chromosomal preparations and other physical 
chromosome mapping techniques may be correlated with additional genetic map data. Examples 
5 ofgeneticmapdatacanbefoundinthel994GenomeIssueofScience(265:1981Q. Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 
genetic disease. The nucleotide sequences of Ihe subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected individuals. 

10 420 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

OUgonucleotides,/.e., small nucleic acid segments, may be readily pieparedby, for 
example, directly synthesizing the oligonucleotideby chemical means, as is commonly practiced 

using an automated oligonucleotide synthesize. 

Support bound oUgonucleotides may be prepared by any of the methods known to those of 

15 skillin1heartusinganysuilablesupportsuchasglass,polystyreneor Teflon. One strategy is to 

precisely spotoUgonucleotidessynthesizedby standard synthesizers, hnmobilizationcanbe 
achievedusingpassiveadsotption(Inouye&Hondo,(1990)J. Clin. Microbiol.28(6) 1469-72); 

using UV light (Nagalaefai, 1985;Dahlene^a/., 1987;Morrissey& Collins, (1989) Mol. Cell 
Probes 3(2) 189-207) or by covalent binding ofbase modified DNA (KeUere< a/., 1988; 1989); all 

20 references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interactionas a linker. For example. Broude.M/. (1994)Proc.Nati. Acad. ScL USA 91(8) 3072-6, 
describe the useofbiotinylatedprobes,althoughthese are duplex probes, that are immobilized on 
stieptavidin-coatedmagneticbeads. Streptavidin-coated beads may be purchasedfromDynal. 

25 Oslo. Ofcourse,thissamelinkingchemistryisapplicabletocoatinganysurfecewithstreptavidi^ 
Biotinylated probes may be purchased fiom various sources, such as, e.g., Operon Technologies 
(Alameda, CA). , 

Nunc Laboratories (NapervUle,IL)is also selling suitablematerial that could be used. Nunc 
LaboratorieshavedevelopedamethodbywhichDNAcanbecovalentiyboundtolhemicrowell 
30 surfacetermedCovalinkNH. CovaLinkNH is a polystyrene smface grafted with secondary amino 
groups(>NH) tiiat serve as bridge-headsforfinthercovalent coupling. CovaLmkModdesnmy be 
purchasedfromNunc Laboratories. DNAmoleculesmay be bound to CovaLmkexdusivelyat the 
5'-endby aphosphoramidatebond,allowingimmobilizationof morethan 1 pmolofDNA 
(Rasmussen«/fl/., (1 991) Anal. Biochem. 198(1) 138-42). 
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The use of CovaLinkNH strips for covalent binding of DNA molecules at Ihe S'-end has 
been described (Rasmussenet al., (1991). In fliis technology, a phosphoramidatebond is employed 
(Chuetal., (1983)NuclMcAcidsRes. 11(8) 6513-29). Ihis is beneficial as immobifizationusing 
only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 

5 CovaLmkNH secondary amino groups that are positioned at the end of spacer arms covalently 
graftedontothepolystyrenesurfecethrougha2 nmlongspacerarm. ToKnkanoUgonucleotideto 
CovaLinkNH via an phosphoramidatebond, the oligonucleotidetenninus musthave a 5'-end 
phosphate group. It is, perhaps, evenpossiblefor biotm to be covalently bound to Covalinkand 
ttoistieptandinusedto bind the probes. 

10 More spedfically.the linkage method includes dissolvingDNA in water (7.5 ng/ul) and 

denaturii^for 10 min. at 95''C and cooling on ice for 10 mm. Ice-cold 0.1 M 1-mefliyOniidazoIe, 
pH 7.0 (l-Mehn?), is then added to a final concentrationof 10 mM 1-Mehn7. A ss DNA solutionis 
then diqwnsed into CovaLinkNH strips (75 ul/well) standii^ on ice. 

Carbodiimide 0.2 M l.ethyl-3-(3-dimediylaminopK)pyl)-carbo<Uimide (EDC), dissolved in 

15 l0mMl-MeIm7,ismadefieshand25uladdedperweU. The strips are incubated for 5 hours at 
50°C. Aflermcubationthestripsare vmhedusmg,e.g.,Nunc-Immuno Wasl^fi^st1hevfells are 
wadied 3 times, thai they are soaked with washing solution for 5 min., and finally they are washed 
3 times (where in the vrashing solution is 0.4 N NaOH, 025% SDS heated to 50°C). 

It is contemplatedthat a further suitable mefliod for use with the present invention is that 

20 describedin PCT Patent AppUcation WO 90/03382 (Soulbem& Maskos),mcorporated herein by 
reference. This method of preparing an oUgonucleotidebound to a support involves attaching a 
nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
hydroxyl groups carriedby flie siqjport. The oligonucleotideis then synthesized on the supported 
nucleoside and protecting groups removed fiom the synthetic ohgonucleotide chain under standard 

25 conditionsthat do not cleave the oHgonuoleotidefiom the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparationof DNA probe for tixe preparation of DNA probe 
arrays may be employed. For exsaaplt, addressable laser-activated photodeprotectionmay be 
employed in the chemical qoithesis of oligonucleotides duwtly on a glass surface, as described by 

30 Fodorc/o/. (1991)Science251(4995)767-73,incorporatedhereinbyreference. Probesmayalso 
be immobilized on nylon supports as describedby Van Ness etal. (1991)Nucleic Acids Res. 
19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 
169(1) 104-8; all references being q)ecificallymcoiporatedherein. 
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To link an oligomicleotideto a nylon support, as described by VanNess et aL (1991), 
requires acdvalionofflie nylon surfeoe via alkyMon and selective activation of the 5'-amine of 

oligonucleolideswilii cyanuric chloride. 

Oneparticiilarvyay topreparesupportboundoligonucleotidesisto utilizethe 

5 Ught-generatedsynthesisdescribedbyPeaseera/., (1994) PNAS USA 91(11) 5Q22-6,mcoiporated 
hereinby reference). These authois used currentphotolithographictechniquesto generate arrays of 
in3mobUizedoUgonucleotideprobes(DNAchips).TTiesemethods,inwMchlightisusedtod^ 
thesynthesisofoUgonucleotideprobesinWEh-deiisity,minia1urizfidaiiays,ut^ 
5'.ptotected JV-acyl-deoxynucleosidephosphoranudites, sutfece linker chemistry and versatile 

10 combinatorialsynthesisstrategies. A matrix of 256 spatially defined oligonucleotideprobes may be 
generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 

DNA, chromosomalDNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
15 mcludingmRNAwithoutanyamplificationsteps. For example, Sambrookc/ a/. (1989) describes 

three protocols for the isolation of high molecular weight DNA from mammaUan cells (p. 

9.14-9.23). 

DNA fragments may be prepared as clones m Ml 3, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PGR or other amplificationmethods. Samples 

20 may be prepared or dispensed mmultiwell plates. About 100-1000 ng of DNA samplesmay be 
prepared m 2-500 ml of final volume. 

The nucleic acids would thenbe fragmentedby any of the methods known to those of skill 
mthe artincludmg,for example, using restriction enzymes as described at 924-9.28 of Sambrooke^ 
al. (1 989), shearing by ultrasound and NaOH treatment. 

25 Lowpressureshearingis also appropriate, as described by Schriefer ef al (1990)Nucleic 

Acids Res. 18(24) 7455-6, incoiporatedherein by reference). In this method, DNA samples are 
passed through a smaU French pressure ceU at a variety of low to intennediate pressures. A lev« 
deviceallowscontroUedapplicationof lowto mtermediate pressures to the cell. Theresultsof 
these studies indicate that low-pressure shearingis ause&l alternative to sonic and en2ymatic DNA 

30 fragmentationmethods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
. baserecogmtionendonucIease,Cv,JI,describedbyFitzgeralderfli (1992) Nucleic Acids Res. 
20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
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of DNAinto particular sizes that Ihey contemplated to be suitable for shotgun cloning and 
sequencing. 

The restriction endonuclease CviJI normaUy cleaves Ihe recognition sequence PuGCPy 
bctweenihe G and C to leaveblunt ends. Atypical reaction conditions, which alter the specifidty of 

5 this enzyme (Cvfll**), yield a quasi-random distribution of DNA fiagments form the small 
moleculepUC19 (2688 base pairs). Fitzgerald etaL (1992) quantitativelyevaluatedlhe 
randomness of this fragmentationstrategy.using a CvOl** digest of pUC19 that was size 
fractionatedby arapidgel filtration method and directly ligated, without end repair, to a lac Z minus 
M13 cloning vector. Sequence analysis of 76 clones ^owedthat CvfJI** restrictspyGCPy and 

10 PuGCPu. in addition to PuGCPy sites, and that new sequence data is accumulated at a rale 

consistent with random fiagmentation. 

As reportedin the Uterature,advantages of this approach comparedtosonication and 

agarose gel fractionationinclude: smaller amounts of DNA are required (02-0.5 ug instead of 2-5 

ug); and fewer steps are involved (no pieligalion, end repair, chemical extraction, or agarose gd 

15 electrophoresisanddvrtion are needed 

IiTespectiveofthemannerinwhichthenucleicacidfragmentsareobtainedorprepared,itis 

imporlantto denatorethe DNA to give single stranded pieces avaUable for hybridization. This is 
achievedbyincubatingtheDNAsohitionfor2-5minutesat80-90°C. The solution is then cooled 
quickly to 2-0 to prevent renaturationofthe DNA fiagments before they are contacted with the 
20 chip. Phoq)hategroupsmustalsoberemovedfix)mgenomicDNAbymethodsknownintheart, 

422 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 
array of wells in a microliter plate) to repeated by transfer of about 20 nl of a DNA solution to a 

25 nylonmembrane. By ofiset printing, a density of dots higherthanthe density of the wells is 

achieved. One to 25 dots may be accommodatedin 1 ram^ depending on the type of label used. By 
avoiding spotting in some preselectednumber of rows and columns, separate subsets (subanays) 
may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or maybe different, overlapped genomic denes. Each of the 

30 subarraysmay represent repUca spotting of the same samples, hi one example, a selected gene 

segment may be amplified from 64 patients. For each patient, the amplified gene segmentmay be in 
one 96-weU plate (aU 96 weUs containingthe same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. 
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Sufeanaysmaycontab 64 samples, one fiomeachpati^Whac the 96 subanaysa^ 
dot span may be 1 mm^ and there may be a 1 nm space between subairays. 

Another approachis to use membianesorplates(avanablefo)mi™C.Napervme,m^^ 
which may be partilionedby physical spacersfc^: aplastic gridmoldedoverthemembrane,1he grid 
5 being similar to the sortof membrane appUed to the bottom of muWweU plates, or hydrophobic 
strips. A fixedphysical spacer is not preferred for imagmgby exposuieto flat phosphor-storage 
screens or x-ray films. 

niepresentinventionis ilhistratedinthe foUowing examples. Upon considerationof the 
presentdisdosuTe.oneofsldUin1heartwiUappreciatethatmanyotherembodimentsandY^ 
10 maybemadcinthescopeoflhepresentinventioa Accordingly, it is intended that the broader 
aspectsofthepresentinventionnotbe limited to the disclosure of the foUowing examples. The 
presentinventionisnottobelimitedmscopebytheexempMedembodhnentsv*ic^ 
as iUustrationsof single aspects of the invention, and compositions and methods which are 
functionallyeqnivalentarewithinthescopeoftheinvention. Indeed, numerous modificadons and 
15 variationsinthepracticeofthemventionareexpededtooccurtothoseskiUedintheartu^^ 
. considerationofthepresentpreferredembodiments. Consequently.the only Hmitations which 
should be placedupon the scope ofthe invention are those which appear in the ^>pended claims. 
AU referencescited within the body of the instant specification are hereby incorporatedby 

reference m their entirely. 
20 5.0 EXAMPLES 

5.1 EXAMPLE 1 

TJnvpl Nnrlric Acid Se f| i»^nces Qbtai n««i Vmm Various Libraries 
A plurality of novel nucleic acids were obtained &om cDNA libraries prepared 6am various 
humantissuesandinsomecaswisolatedfi-omagenomicUbraryderivedfromhumanchromosome 

25 using standaidPCR. SBH sequence signature analysis and Sanger sequencing techniques, llie 
insertsoftheUbrary were amplified withPCRusingprimers specific for the vector sequences 

which flankthe inserts. Clones fiomcDNAUbraries were spotted on nylon membrane filters and 
screened with oligonucleotideprobes(e.g.,7-mers)to obtain signature sequences. TTiedones 
clusteredinio groupsofsunilar or identical sequences. Representativeclones were selected^ 

30 sequencing. 

In some cases, the 5' sequence of the amplified mserts vm then deducedusing a typical 
Sangersequencmgprotocol. PCRproductsv^rerepurifiedandsulgected to fluorescent dye 
teiminatorcycle sequencing. Singlepass gel sequencing was done usinga377AppUedBiosystems 
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(ABl) sequekcerto obtain the novel nucleic add sequences. In some cases RACE (Random 
Amplificationof cDNAEnds) was performedto further extend the sequenc&inihe 5' directioa 



5^2 EXAMPLE! 
5 Novel Contigs 

The novel contigs of the invention were assembled from sequences that were obtained from 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
from one or more public databases. The sequences for the resulting nucleic acid contigs are 
designated as SEQ ID NO: l-l 350 and are provided in the attached Sequence Listing. The contigs 

10 wereassembledusinganESTsequenceasaseed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 
(Le., Hyseq's database containing EST sequences, dbEST version 1 14, gb pri 1 14, and UniGene 
version 101) that belong to this assemblage. Ibe algorithm terminated when there was no 
additional sequences from the above databases that would extend the assemblage. Inclusion of 

15 , componentsequencesinto the assemblage was based on a BLASTNhit to the extending 
assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

Table 3 sets forth the novel predicted polypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO: 1 89-282) of the present invention, and their corresponding 
nucleotidelocationstoeachofSEQIDNO: 189-282. Table 3 also indicates the method by which 

20 the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 
program called FASTY (available from http://fasta.bioch.virginia.edu) which selects a polypeptide 
based on a comparison of the translated novel polynucleotide to known polynucleotides (W JL 
Pearson, Methods in Enzymology. 183:63-98 (1990), herein incorporated by reference). MethodB 
refers to a polypeptide obtained by using a software program called GenScan for human/vettebtate 

25 sequences(availablefromStanfordUnivereity,OfiiceofTechnologyLicensing)thatpredictsthe 
polypeptide based on a probabihstic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol. Biol., 268:78-94 (1997), incorporated herein by reference). Method C refers 
to a polypeptide obtained by using a Hyseq proprietary software program that translatesthe novel 
polynucleotide and its complementary strand into six possible amino acid sequences (forward and 

30 reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-1350 were obtauied by a BLASTP 
■ version 2.0al 19MP-WashU search against Genpept release 120 and Geneseq database October 
12, 2000, update 21 (Derwent). using BLAST algorithm. The nearest neighbor result showed the 
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Closest homologue for SEQ ID NO:1-1350. The nearest neighbor results foi SEQ ID NO: 1- 

1 350 are shown in Table 2 bdow. 

Tables 1, 2 and 3 follow. Tablel showsthevaiio\istissuesourcesof SEQIDNO: 1-1350. 
Table2 showsthenearestneighborresultforthe assembled contig. The nearestneighbor result 
shows the closest homolog with an identifiablefimction for each assembIage.Table3conto^ 
stMand stop nucleotides for the translated amino acid sequencefor which each assemblage 
encodes. Table 3 alsoprovidesa correlationbetweenthe amino acid sequences set forthin the 
Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID NO. h 
USSN 09/496,914. 



98 



wo 01/57188 



PCTAJSOl/03800 



TABLE 1 



Tissue Origin 


RNA Source 


Hyseq Library Name 


SEQIDNOS: 


adult bsrain 


GIBCO 


AB3001 


ill 151 188215 662-665 877 910 927 
976 1233 1319 


adult brain 


GIBCO 


ABD003 


41 49 74 101 111 120 132 141-142 151 
217 225 238 271 317 404 446 469 503 
513-514 535 550 564 573 666-669 798 
898 910 927 976 1067 1083 1085 1178 
1254 


adult brain 


Clontecb 


ABROOl 


39 216 238 327 356 535 927 1056 1 121 
1178-11801199 1251 


adult brain 


Clontech 


ABR006 


74 611 949 10341136 


adult brain 


Clontecb 


ABR008 
• 


14 32 41 61 81 86 89 120 132 138 145 
147 188 197 208 225 227-239 250 300- 
303 312 316 328-331 340 357-362 374 
380 384-391 408 414 446 448 464-467 
483 488 495-496 505 512 521 535 550 
566 571 577 585 590 594 598 634 641 
658 666 683 725 742 764 767 786 801 
805 810 823 826 829 831 836 841 887- 
923 927 934 943 950-951 963 976 995 
1000-1001 1006 1026 10341048 1057- 
1067 1086 1088 1090 1118 1120 1122- 
1128 1142 1162 1181-1192 1199 1204 
1218-1219 1225 1232 1253 1267 1271- 
1306 1342.13471349-1350 


adult brain 


Clontech 


ABROll 


49 238 1219 


adult brain 


BioChain 


ABR012 


74238 


adult brain 


Invitrogen 


ABR013 


868 1268 


adult brain 


Ihvitrogen 


ABT004 


49 1 17 138 191 217 252 291 305 535 
566 596 663 670 746 798 816-819 876 
892 898 922 943 963 1034-1036 1 121 


cultured 
preadipocytes 


Strategene 


ADPOOl 


41 74 101 138 21 1 238 304 537 582 
740798 883 943 976 1067 


adrenal gland 


Clontecb 


ADR002 


"4974 101 111 120127 151215 238 
240-247 316 330 363-364 404 414 534- 
535 833 924-940 950 963 976 1001 
1003 1067-1070 1118 1156 1193-1200 

1325 


adult heart 


GBCO 


AHROOl 


38 49 71-7274-77 79 92 99101 111 
118 129 132 138 151 158-163 182 195- 
203 215 217 238 264 269 353 384 398 
408 434-439 446 504 512-513 519 537 
562-573 577 611-614 616-619 658 661 
671-672 722 734 757-773 815 52o-oiJ 
874 891 898 919 926-927 976 988 
1021 1037 1041 1062 1067 1071 1080 
1083 1093 1122 1131 llo5 1201 1^34 
1308 1331 1335 


adult kidney 


GIBCO 


AKDOOl 


41 49 51 71-74 78-85 94 100-101 103- 
107 111 119-120 138 151 157215217- 
218 238 250 264 294 304 384 404 440 
446454 477 504-505 509 514 518-519 
535 537 564 574-583 620-627 639 653 
673-675 705 753 789 831 844 851 859 
877 909 918 927 956 963 976 1067 
1074 1083 1095 1178 1302 1331 1335 


adult kidney 


Invitrogen 


AKT002 


11-12 41 49111-112215-217294 316 
446 487 564 575 844 868 910 927 976 
1116 


adult lung 


GIBCO 


ALGOOl 


8101 111 151 187 402 446 490 514 
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KNA Source 


Hyseq Library Name 


SEQIDNOS: 

518 537 545 549 580 582 592 594634 
640 651-652 676-678 725 851 873 ,18 
952 976 1042 1067 1076 1083 1152 


lymph node 


Clontech 


ALNOOl 


0 in 1^1 1^1 180-187 188 715 557 
545 549 651 679-682789 804-810 868 
873 927 952 976 1042 1059 1335 


young liver 


GIBCO 


ALVOOl 


8 64 79 11 1 186 215-216238 446 514 
519 537 564 653 683-684 698 753 798 

813 833 840 858 927 976 1038-1039 
1051 1085 1224 1245 1256 


adult liver 


Invitrogen 


ALV002 
ALV003 


40 71 292-293 305 384 468-469 496 
505 657 675 714 753 832 844 941-942 
976 1040 1076 1256 1293 
976 


adult liver 
adult ovary 


Clontech 
Invitrogen 


AOVOOl 


8 32 35 3o 41 4y Dl M i** ly-oU lui 
104 111 120 122-125 138 140 143-149 
151 188-190207-212215-217238 264 
3 1 6 384 409 440 445-446 496 504 5 12 
514 518-519 535 537 549-550 564 566 
571 580 582 600 618 638 657 667 681 
685-697 699 705 722 735-744 761 771 
815 833 842-865 868 875-876 918 926- 
927 950 952 963 976 1023 1042 1048 
1051 1059 1072 1076 1083 1117 1120 
1 124 1 13 1 1 144 1 174 1224 1268 133 1 


adult placenta 


Clontech 


APT om 


1335 

102 217 238 537641700 


placenta 
adult spleen 


Invitrogen 
GBCO 


APL002 
ASPOOl 


663 851 1048 

8 45 74 1 1 1 132 140 151 185 217 238 
294 414 446 477 504 5 14 534 545 549 
592 722 873 883 952 976 1041-1042 
1083 1093-10941152 1224 


testis 


GIBCO 


ATSOOl 


72 107 111 113 126 140151 183 215 
238 446 497 537 642 701-706 811 877 
927 962 976 1083 11171131 


adult bladder 


Invitrogen 


15L1/UU1 


41 151 191 402-405409 414 496 545 
592 607 706 873 952 1178 1329-1335 


bone maiTOw 


Clontech 


BMDOOl 


8 58-62 65-68 74 79 108 111 1 16 137 
147 151 164-174 213-215 238 305-307 
374 404 446 460 466 516 519 534 538- 
541 544-546 549-554 566 584 586 592 
596 607 610 628-629 643^5 652 707- 
708 774-789 844 866-871 873 919 927 
952 963 976 998 1034 1042 1064 1083 
1085 1120 1132 1152 1225 1229 1268 
1307 1310 


bone mairow 


Clontech 


BMD002 


6 8 37-38 52 74 77 105 1 1 1 129 132 
210 317 510-51 1 545 549 581 598 628 
638 724 766 789 844 860 868 873 919 
927952 963 968976 1042 1111 1141 
1160-1161 1229 1266 1346 


bone marrow 


Clontech 


BMD004 


111 238 282 549 1083 


adult colon 


Invitrogen 


CLNOOl 


' 52 260 264 299494 536 545 564 592 
844 873 877 952 976 1042 1152 1268 
1336-1337 


adult cervix 
diaphragm 


BioChain 
BioChain 


CVXOOl 
DIA002 


" 4951 129 132 151205207238332- 
335 365-367 392-401 440 466 470-471 
5 1 8 537 597 629 832 877 927 976 1006 
1085 1117 1129-11341192 1202-1205 
1219 1309-1328 
74976 1083 
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Tissue Origin 
endothelial cells 


RNA Source 
Strategene 


Hyseq Library Name T 
EDTOOl 


SEQroNOS: 

32 40-41 49 74 79 101 1 1 1 120 132 
138 151204-206215-217238269 316 
414 433 505 510 513 550 555 580 582 
596 675 722 745 798 814 836-841 851 
918 976 1041 1043 1073 1083 1131 
1331 


Genomic clones 
from the short arm 
of chromosome 8 


Genomic DNA 
from Genetic 
Research 


EPMOOl 


525-532 927 


Genomic clones 
from the short arm 
of chromosome 8 


Genomic DNA 
from Genetic 
Research 


JEPMOOS 


47525 


Genomic clones 
from the short arm 
of chromosome 8 


Genomic DNA 
from Genetic 
Researdi 


EPM004 


525927 


Genomic clones 
from the short arm 
oi cnromosome o 
esophagus 


Genomic DNA 
from Genetic 

PacAflrch 

BioCham 


EPM005 

ESO002 


531 

74 138 238 
441-442 927 


fetal brain 
fetal brain 
fetal brain 


Clontech 
Clontech 
Lriontecn 


FBROOl 
FBR004 
FBR006 


215 893 927 1001 

48 61 101 120132 138 140 147208 
225 271 317 319 336 359 368 405-414 
519 550 571 594 686 715 722 764 824 
829 836 859 909 927 943 947 963 1057 
1067-1068 1104 1135-1140 1162 1206- 
1207 1235 1268 1288 1307-1308 1319 
1338-1350 
111446 


fetal brain 
fetal brain 


Clontech 
Invitrogen 


FBRs03 
FBT002 


Ai ^1 190 ISI 192-194 264 504512 
535 683 761 798 820-827 844 876 909 
963 976 1026 1048 1083 1144 1302 
446 566761 


fetal heart 
fetal kidney 

fetal kidney 


Invitrogen 
Clontech 

Clontech 


FHROOl 
FKDOOl 

FKD002 
FKD007 


"51 74 111 127 140 151 184 294 537 
550 630-631 1319 
1119761083 
238 974 


III 


Invitrogen 

Clontech 

Invitrogen 


FLGOOl 
FLG003 


463 566976 1074 1083 1093 

41 238 330 407 415-416 537 573 844 

859 1048 1083 11161192 


fetal liver-^leen 


Columbia 
University 


FLSOOl 


8 14 34-35 37 41 43 49 51 54-56 63-64 
69-71 74 77 79 87-90 101 107 110-11 1 
114 120 128-131 138 140 147 150-155 
197 210 215 217 225 238 3 12 367 384 
414 440 446 460 468 483 496 504-507 

544-545 547-550 555-560 564 566 571 
577 582 585-586 598 636 646-647 649 
652 664 698 709-710 714 722-723 73 1 
735-736746-753 761 784798 823 829 
832 844 851 858-859 868 873 876 898 
927 943 949 952 963 976 984 1002 
1021 1023 1040 1042 1044 1050 1083 
1093 1116 1120 1129 1I3I 1144 1174 
1217 1251 1254 1256 1302 1308 1311 
1319 


fetal liver-spleen 


Columbia 
University 


FLS002 


' ■ 8 36-37 41-4649 54 64 71 74 79 101 
111 120 129 147207 210215-216238 
250 330 353 359 366 383-384 414 478 
505 508-509 51 1 515-524 534-535 537 
544-545 564 566 571 577 591 598 638 
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Tissue Origin 


RNA Source 


Hyseq Library Name 


SEOIDNOS: 

663 671 69S 714 722 725 727 751 /vo 
851 859 873 876 909 927 949 952 983- 
984 1002 1023 1042-1044 1085 1095 
1131 1144 1178 1199 1233 1240-1270 
1331 1340 


fetal liver-spleen 


Columbia 
University 


FLS003 


64 535 976 1256 


fetal liver 


Invitrogen 


FLVOOl 
FLV004 


"8 101 120 138 217 446 468 535 566 
580 722 730 749 844 918 943 976 1051 
1256 1331 
537 926 1256 


fetal liver 

fetal muscle 


Clontech 
Invitrogen 


FMSOOi 
FMS002 


51 1 1 1 264 312 369-370 404 417-421 
425 535 537 577 598 614 836 857 1 141 
1208 1268 
537 


fetal muscle 

fetal skin 

fetal skin 


Invitrogen 
Invitrogen 

Invitrogen 


FSKOOl 
FSK002 


13-26 32 41 51 89 107 111 147 151 
225 264 316 405 422-429 488-494 496 
<io s-^^ ^"^7 Sfi6 675 732 859 876- 
877 898 947 949-950 963 976 1001 • 
1062 1076 1083 1117 1144 1165 1268 
1281 
537 812 


fetal spleen 
umbilical cord 


BioChain 
BioChain 


FSPOOl 
FUCOOl 


87 549 

"27-33 4149 151215 238 248-249 301 
316 446 495-503 519 521 534-535 537 
582 634 691 877 883 927 944-950 963 
976 1001 1075 1142-1143 1171 1218 
1243 1308 


fetal brain 


GIBCO 


HFBOOl 
HMPOOl 


41 49 5779 87 103 111 120 132-135 

138 145 151 188 197207 215 238264 

271 294 316 367 414 440 446 466 504 

513-514 535 542-543 550 564 571 596 

635 648-654 675 711-715 722-723 798 

832 872 876 883 927 976 1095 1 144 

1 1 <;« 1 171 1 17R 191 1 1335 
1 lOo ll/i ii/o iibii ijjj 

238 


macrophage 

infant brain . 


Invitrogen 
Columbia 
University 


IB2002 


"49-50 77 81 89 105 111 136-138 140 
1^1 17^170 185 216-217264 295 
299 308-310 371-373 462 476 504 51 1- 
5^'? 537 564 566 571 655-657 662 
683 716-720 723 752 790-803 829 832 
858-859 876 898 909 949 976 1045- 
1047 1076-1087 1090 1093 1116 1122 
1144 1209-1213 1225 1233 1256 1319 
1341 


infent brain 


Columbia 
University 


IB2003 


41 50 77 104 132 215 238 508 512-513 
519 566 655 714 794 918 943 976 1067 
1092-1093 1233 


infant brain 


Columbia 
University 


IBM002 


311472-473 753 1214 


UliOUl UXalU. 


Columbia 
University 


IBSOOl 


■■ 5 1 1 1 1 376 474 790 876 949 1 144 1204 
1221 

■ ""151 316 462 5 14 534 582 675 93y 1 131 


lung , fibroblast 
lung tumor 


Strategene 
Invitrogen 


LFBOOl 
LGT002 


■ 1-7 41747994115120138-13^150 
215 217 269 280 296 337 374-375 384 
404 446 454 475-480 498 514518-519 
522 537 545 564 577 597 653 658 705 
721-724 754-756 779 859 868 872-874 
876-877 919 927 949 951-952 959 976 
1002 1042 1048-1053 1076 1083 1088- 
1089 1131 1144-1147 1216-1218 1229 
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Tissue Origin 
lymphocytes 


KNA Source 
ATCC 


Jyseq Library Name 
LPCOOl 


SEQ ID NOS: 
1293 1311 

4174 111 132 151253 316446 550 
634 844 927 976 1085 1268 


leukocyte 


GIBCO 


LUCOOl 


8 11417486 91-98 101 109111 120 
147 151 212 215 218 238 252 288 312- 
3 14316 338 359 408 427 443-447 505 
510 512 514 518 534 545 549-550561 
564 566 571 577 580 582 587-609 615 
632-638 658-659 698 714 725-728 832 
836 841 859 866 873-874 882-883 918- 
919 927 943 952 963 976 1042 1076 
1083 10901148 1152 11681195 1219- 
1220 1224 


leukocyte 


Clontedi 


LUC003 


74 100 215 232 238 339-341 446 545 
657 660 729 873 883 927 952 963 1008 
1042 1116 1120 1149-1150 1215 1222 


Melanoma from cell 

lineATCC^CRL 

1424 


Clontech 


MEL004 


o 1 m 1 < Olfi S45 592 722 873 

919 929 939 952976 1071 1118 1218 

1235 1243 


mammary gland 


Invitrogen 


MMGOOl 


8-10 40-41 49 73 80 114 13U-14U 14/ 
217 250-256 264 297-299 305 377-378 
398 446 481-486 505 512 537 545 549 
571 592 725 730-733 816 829 836 844 
868 873 876-877 898 926 943 951-960 
963 976 995 1034 1042 1048 1054- 
1055 1076 1083 1091 1093 1116-1117 
11241152 1302 


bduced neuron cells 


Strategene 


NTDOOl 


39 101 111 138238 361 1225 1251 

1319 


retinoid acid induced 

neuronal cells 

neuronal cells 


Strategene 

Strategene 


NTKUUl 

NTUOOl 
PIT004 


74 225 976 

129 225 238 304 313 361 657 976 
976 


pituitaiy gland 
prostate . 


Clontech 
Clontech 
Clontech 


PLA003 
PRTOOl 


38 976 

1 1 1 188 238 257-258 564 724 961-966 
1067 1095 


rectum 


Invitrogen 


RECOOl 


238 430-431 841 859 868 963 1001 
1116 


cck1{\rsii*t/ olnnfi 
Sallvaijr giBUiu 


Clontech 


SALOOl 


8 151 402 432-433 446 496 868 95i! 
976 1083 1120 1151 1184 


small intestine 


Clontech 


SINOOl 


■ 8 101 147 215 259-266 446 462 505 
545 592 660 789 836 866 873 927 952 
963 967-978 1042 1120 1152 1223- 








1224 

238 302 927 943 992 1031 


skeletal muscle 
spinal cord 


Clontech 
Clontech 


SKMOOI 
SPCOOl 


74 111 132 151 215-216 235 264 2G> 
270 343-344 353 379 516 537 566 740 
828 927 976 979-994 1092 1153-1159 








1225 1250 
698 859 1042 


adult spleen 

ctnmach 


Clontech 
Clontech 


SPLcOl 
STOOOl 


210 238 271-272 537 580 705 918 952 
995 1171 


thalamus 


Clontech 


raA002 


61 219-220273-276 312 315 330 596 
963 996-1007 1059 1093 1160-1162 


thymus 


Clonetech 


THMOOl 


8 120 151 208 221 316-317353 Wii 
750 867 874 878-881 927 963 1023 
1083 1094-10961124 


thymus 


Clontech 


THMc02 


■ "8 61 114 129 132 210 225 231 306 
3 17-3 19 336 340 359 380 398 446 448- 
463 512 519 545 554 587 598 698 724- 
725 789 812 836 868 873 927 947 952 
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Tic«ie Orisin 


RNA Source 


Hyseq Library Name 


SEQIDNOS: 

976 1007 1042 1083 1085 1097-1110 
1122 1147 1177 1226-1229 1234 1311 
1313 


thyroid gland 


Clontecli 


THROOl 


14 41 49 76 94 111 144 151 183 188 
210 217 222 253 264 271 277-286 294 
320-326 345-352 361 381-382446 467 
483 514 534 549-550 564 578 602 649 
844 882-883 927 950 956 976 1008- . 
1028 1076 1083 1117-11201142 1163- 
1175 1230-12381308 


trachea 


Clontech 


TRCOOl 


223-225 238 287 353-354 514 
545 592 611 873 883-884 927 
9521029-1031 1042 1151-1152 
11701176-1177 1239 


uterus 


Clontech 


UTROOl 


151 226 288-290 355 537 877 
885-886 976 1001 1032-1033 
1232 



TABLE 2 



SEQ 
ID 


Accession 
No. 


Species 


Description 


OIUIUI** 

Waterman 
Score 


/o 

Identity 


1 


B02829 


Homo sapiens 


Human G protein coupled receptor hRUPS 

protein SEQ ID NO:10. 

Human secreted protein, SEO ID NO: 7645. 


460 
111 


100 
51 


2 

a 
J 


G03564 
R26173 


Homo sapiens 
Homo salens 


Part of Major Yo paraneoplastic antigen 
(CDR62) encoded by clone pY2. 


293 


76 


4 


L29536 


Homo sapiens 


calcium channel L-type alpha 1 subunit 


191 


65 


5 


Y94943 


Homo sapiens 


Human secreted protein clone ytl4_l protein 
sequence SEO ID NO:92. 


251 


50 


6 


MU507 


Homo sapiens 


transferrin receptor 


120 
1941 


95 
93 


7 
8 


AF099100 
Y92338 


Homo sapiens 
Homo sapiens 


WD-repeat protein 6 

Human cancer associated antigen precursor from 
doneNY-REN-45. 


245 


82 


9 


G01343 


Homo sapiens 


Human secreted protein, SEO ID NO: 5424. 


226 


91 


10 


AJ133798 


Homo sapiens 


copine VII protein 


1127 


68 


11 


G02449 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6530. 


584 
282 


99 
78 


12 
13 


X98330 
AL024498 


Homo sapiens 
Homo sapiens 


ryanodine receptor 2 

d/417M14.2 (novel serine/threoninc-protdn 
kinase (ortholog of mouse and rat MAK (male 
germ cell-associated kinase)) 


293 


100 


14 


AFW5577 


Pan 

troglodytes 


ol&ctoiy receptor OR93Ch 


191 


36 


15 


G03131 


Homo sapiens 


Human secreted protein, SEO ID NO: 721Z 


93 


39 


16 


U26595 


Rattus 
norvegicus 


prostaglandin F2a receptor regulatory protein 
precursor 


569 


89 


17 


B08918 


Homo sapiens 


Human secreted protein sequence encoded by 
gcnc28SEQIDNO:75. 


99 


44 


18 


Y36203 


Homo sapiens 


Huihan secreted protein #75. 


165 


75 


19 


U15647 


Mus 

musculus 


reverse transcriptase 


106 


40 


20 


G02701 


Homo sapiens 


Human secreted protein, SEQ CD NO: 6782. 


544 


100 


21 


Y35923 


Homo SE^iens 


Extended human secreted protein sequence, SEQ 
ID NO. 172. 

Human secreted protein, SEO ID NO: 81 11. 


1691 
380 


100 
"96 


22 
23 


G04O30 
Q02455 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEO ID NO: 6536. 


123 


50 


24 


AF036329 


Homo sapiens 


gonadotropin-releasing homionc precursor, 
second form 

"Human secreted protein, SEQ ID NO: 8148. 


284 
"96 


90 

32 


25 
26 
27 
28 


G04067 
S80119 
U83303 
G03267 


Homo sapiens 
Rattus sp. 
Homo sapiens 
Homo sapiens 


reverse transcriptase homolog 

Une-1 reverse transcriptase 

Human secreted protein, SEQ ID NO: 734K. 


100 
101 
135 


34 
35 
45 



104 



wo 01/57188 



PCTAJSOl/03800 



ID 

NO: 



29 



Accession 
No. 



G04067 



G02872 



Descripbon 



H^mo sapiens Human secreted p rotein. SEQ ID NO: 8148. 
Homo sapiens Human secreted pro teuL SEQ ID NO: 6953. 



WatennaD 
Score 



Human secreted F otrim SEQ ID NO: 7452. 



83 
116 



96 



Identity 

42 
72 



67 
32 



32 
33 
34 



G03224 
Y66688 



Human g^rrptf^d protein. SEQ ID NO: 7305. 



Homo sapiens 



Membrane-bound protein PROl 152. 



Y87071 



Homo sapiens 



Human secreted protein sequence SEQ ID 
NO: no. 



348 



m5i3i 

Y73464 



pl26 



182 
982 



98 
95 



48 
90 



35 

IT 



Homo ss^iens 



Human secreted protein clone yl4 J protein 
sequence SEQ ID NO:150. 



37 



38 



39 



40 



41 



42 



43 



44 



45 



46 



47 
48 



49 



AL133215 



Homo sapiens 



bA108L7.6 (scmaphorin W iscma domam, 
immunoL 

domain (TM) and short 



AC067969 



AL031588 



G03628 



AF 132969 



Y36268 



X61048 



M76546 



U82288 



G03477 



AF090942 



G03564 



AJ005560 



amino acids 
3338-4088 



Homo sapiens lyanodine receptor 1 (skeletal} 



386 



Homo sapiens 



Homo sapiens 



Homo sapiens 



Homo sapiens 
Hydra sp. 
Helianthus 
annuus 



Cacnoitiabditi 
s elegans 



Homo sapiens 



Homo sapiens 



Homo sapiens 



Mus 

musculus 



dJ 1 163J1.1 (mostly supported by GENSCAN. 
FGENES and GENEWISE) 



493 



HiiTnan secreted protcin. SEQ ID NO: 7709^ 



10 



CGl-35 protem 



228 



Human secreted protein encoded by gene 45. 



220 



mini-collagen 



hydroxyproline-rich protein 



105 
TIT 



Rac-iikeGTPase 



139 



Human secreted protein, SKQ ID NO: 7558. 



118 



PRO0657 



113 



Human secTOtcd ptotein. SEQ lU NOTTgjj^ 



SPB2B protein 



2L 
72 



"ST 



76 



51 
68 



88 
35 
31 



70 



58 



63 
59 



50 
51 



52 
53 



54 
55 



G02450 



Homo sapiens 



Y91649 



Homo sapiens 



U93563 



Y55927 



G02607 



AB008175 



M68941 
AL031600 



Human secreted protein, SfcQ ID NO: 653 1 . 
Human secreted protein sequence encoded by 
genc60SEQlDNO:322. 



973 



Homo sapiens 



Homo sapiens 



Homo sapiens 



Mus 

musculus 
Homo sapiens 



Homo siq)iens 



Homo sapiens 



Homo sapiens 
Homo sapiens 



Homo sapiens 



Homo sapiens 
Homo sapiens 



putative pi 50 



105 



Human STLK2 protein 



699 



Fluman secreted protein, SEQ ID NO: 6688. 



145 



hepatic nuclear fector 1-beta short fonn 
protein-tyrosine phophatase 



356 
16r 



C390E6.1 (chloride channel 7)" 
putative pheromone receptor 

zinc finger protem ZFPl 13 

interferon reguitoiy factor 7 

protein-tyrosine kinase 

Human secreted protem clone cb98_4. 

repressor transcriptional factor 

ubiquitin-conjugating BIR-domain en2yme 

APOLLON 

Human secreted protein, SbQ iD NO: 7964. 
antcnnal specific membrane protein AMP 



Human secreted protein, SEQ ID NO: 7046. 
Human oxidoreductase YTFQ3, 



94 



38 
85 



56 
74 



41 
76 



105 
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SEQ 

ID 

NO: 


AccessioQ 
No. 


Species 


Description 


^mith- 

Wateiman 

Score 


% 

Identity 


75 


Y00826 


musculus 
Rattus 


gp210(AA 1-1886) 


413 


84 


76 


AFl 17754 


Homo sapiens 


thyroid hormone receptor-associated pfotdn 
complex component TRAP240 


351 
468 


54 

nc 
to 


77 


Y38422 
Y14596 


Homo sapiens 
Homo sapiens 


Human secreted protein. 

Human T-type voltage-gated Ca channel alpha- 

l-l(hCavT3). 


1357 


99 


79 


¥14591 


Human 
papillomaviiu 

stype 68 


ArJVi'i proicul 


767 
71 


100 
34 


80 
81 


AL137802 
AF000383 


Homo sapiens 

Arabidopsis 

thaliana 


protem arginuicN-methyltransfcrasc-likc protein 


359 


65 


82 


U6815 


Mus 

musculus 


DNA binding protein Rc 

Human secreted protein, SEO ID NO: 5681. 


895 
315 


75 
96 


83 
84 

85 


G01600 
Y53886 

AB029002 


Homo sapiens 
Homo sapiens 

Homo sapiens 


A suppressor of cytokine signalling protein 
designated HSCOP-6. 
KIAAl 079 protem 


538 
134 


71 

42 


86 
87 


Y28678 
Y99368 


Homo sapiens 
Homo sapiens 


Human cv/272 7 secreted orotem. 
Human PR01326 (UNQ686) ammo acid 
sequence SEQ ID NOtlOO. 


325 
156 


62 
48 


88 

89 
90 


AJ225124 

AF177203 
Y28280 


Mus 

musculus 
Homo sapiens 
Homo sapiens 


hypeipolarization-activated cation channel, 
HAC3 

cerebral cell adhesion molecule 

Human G-protein coupled receptor GRlR-2. 

polycystic kidney disease-associated protein 


487 

290 
326 
1751 


05 

56 
79 
95 


91 
92 
93 
94 


L39891 
AF064876 
AF170723 
X13292 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Trypanosoma 
brucei 


ion channel BCNG-1 
protein kinase SfKlO 
GPl-phospholipase u ^aa i • ojo) 


953 
401 
151 

"851 


99 
53 
37 

"99 


95 
96 

97 


Y34127 
X03638 

AF134213 


Homo sapiens 
Rattus 
norvegicus 
Homo sapiens 


Human potassium cnannci fw-i-rmovi i. 
sodium channel protein I (aa 1-2009) 

ubiquitin-specific protease 


1775 
1995 


92 
99 


98 
99 


G00838 
AF021935 


Homo sapiens 

Rattus 

norvegicus 


Human secreted protein, SbU ID NO: 4919. 
mytonic dystrophy kinasc-related Cdc42-binding 
kinase 


213 
675 

867 


38 
48 

98 


100 
101 


AF279265 
AC007878 


Homo sapiens 
Homo sapiens 


putative anion transporter I 

match to nuclear protein, NP220; note: sequence 

difference at residue 58 


160 


60 


102 


U22829 


Mus 

musculus 


P2Ypurinoceptor 


264 


42 


103 


Y45Q23 


Homo sapiens 


Human sensoiy transduction G-protein coupled 
receptor-B3. 

Human secreted protein vb21 1. SEQ ID NO:20. 


516 
787 


99 
98 


104 
105 

106 


Y94990 
Y87342 

AF169312 


Homo sapiens 
Homo sapiens 

Homo sapiens 


Human signal peptide containing protein HSPP- 

n9SEQIDNO:ll9. 

hepatic ongiopoietin-related protein 


343 
212 


57 
CI 

Of 


107 
108 


AFn6657 
AE000401 


Homo sapiens 

Escherichia 

coli 


PRO1310 

sialic acid transporter 

Human seweted protein encoded by gene No. lu. 


74 
587 

693 


52 
96 

100 


109 
110 


Y38395 
Y78801 


Homo sapiens 
Homo sapiens 


Hydrophobic domain containing protein clone 
HP0063 1 amino acid sequence, 
nuclear pore complex protein hnupl53 


182 
464 


85 


111 
112 

113 


Z25535 
Y94939 

AF016365 


Homo sapiens 
Homo sapiens 

Homo sapiens 


' Human seaeted protein clone ye90_l protein 
sequence SEO ID NO:84. 
hcxokinaselisoformtd 


274 

301 
520 


51 

71 
75 


114 
115 
116 

117 


AC007956 

M83738 

AL157952 

W18084 


Homo sapiens 
_Homo sapiens 
Homo sapiens 

Homo sapiens 


unknown 

protein-tyrosine phosphatase 

dJ875K15.1.1 (ets homologous factor (ets- 
domam transcription fiictor ESE-3A, isofonn 1)) 
Human Aurora.2. 


251 
484 

546 


92 
91 

87 



106 
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1 SEQ i 
ID 1 
NO: 


Accession 

Ho. 


Species 

gomo sapienF 

^ttas 

norvegicus 


Description 

camkinasel , 

Mttnenliatifiul inositol 3-KmaSe 

pnoronouuyiumD«*v» •* 


Smith- r 
Waterman 
Score 
407 


% 

Identity 


1 118 


L41S16 
AJ006710 


627 


93 


120 


AF026954 


Bostaurus 


n^/niuQtA Hi^hvdmaenase Dhosohat&se i^ulBtory 
subunit precursor. PDPr 


1646 


94 


121 
122 


S39392 
U60805 


Homo sapiens 
Homo sapiens 


protein tyrosine phosphatase, PTPase {EC 

1 3.1.3.48} 

1 oncostatin-M specific receptor beta subunit 


373 
262 


68 

QQ 

( >0 


123 
124 


Y44403 
U88167 


Homo sapiens 
Caenorfiabditi 
selegans 


r Human truncated tankyrase-l. 

rcontains similarity to C2 domains 


111 
219 


35 
29 


125 


AF300648 


Homo sapiens 


4 


693 


90 

15 


126 


AB021861 


Mus 

musculus 


apoptosis signal-regulating kinase 2 


153 




127 
128 


AF305210 
M90360 


Homo sapiens 
Homo sapiens 


1 concentrative Na+"nucico5iuc couuui'uiu'i 

hCNT3 
1 protein kinase 

alpha IC adrenergic receptor isoform 2 


807 

220 
574 


73 
86 


1 129 
1 130 
131 

132 


AK208043 
AF201734 

AF112886 


Homo sapiens^ 
Mus 

musculus 
Bos taurus 


IFIl6b 

testis specific serine kinase-3 
differentiation enhancing factor 1 


496 
800 

159 


67 

87 

74 


1 133 
134 

1 135 


AJ278314 
W74802 

AB020335 


Homo sapiens 
Homo sapiens 

Homo sapiens 


1 phospholipase C-beta-lb — — 

Human secreted protem encoded by gene 73 
clone HSQEL25. , 

1 Pancreas-specific gene 


554 
1157 

"SSS 


85 
87 

"96 


136 
137 


W80408 
AC002563 


Homo sapiens 
Homo sapiens 


1 A secreted protein encoded by clone dt674_2. 
putative RHO/RAC eflector protem; 95% 
similarity to P49203 (PID:gl 345860) 

1 PR03434, a novel secreted protein. 


866 
5041 

891 


98 
99 

100 


138 
139 

1 140 


Y96736 
AB024034 

W97809 


Homo sapiens 
Arabidopsis 
thaliana 
Homo sapiens 


DNA-damage inducible protein DDIl-iike 
1 Human GTPase regulator GRAF. 


147 
248 


55 
56 


1 141 
142 


Y51557 
AF090U3 


Homo sapiens Human PLA2 protem. 

Rattus AMPA receptor bmding protein 

norvisgicus 


125 
623 

641 


46 
93 

82 


143 
1 144 


W26642 


Homo sapiens Human RECK cancer-mlubitrng proiein. 

Itattus transmembrane receptor 

norvegicus | , ; — — — 


578 


84 


145 
146 


AF264014 
W63683 


Tiomo sapiens scavenger receptor cyslemc-nch type 1 protein 
Ml 60 precursor 

Homo sapiens Human secreted protein 3. — 


727 
140 


92 

40 • 


147 
148 


M96264 
D64014 


"Homo sapiens- | ealactose-l-phosphate undyi transferase 

Escherichia HrsA 

coll ^ : — ^ 


513 
818 


81 
90 


149 


M83316 


Escherichia 
coli 


pppGpp phosphohydrolase 


915 


95 


150 


AL163279 


■ "iHomo sapiens homolog to cAMP response element bmdmg ana 
1 beta transducin family proteins 


1261 
940 


77 

99 


1 131 
152 

153 
154 


AF179867 
R95332 

AF151859 
X66957 


Homo sapiens STE20-like kmase 
■ Homo sapiens Tumor necrosis tactor receptor l oeam aomam 

ligand (clone 3TW). 

Homo sapiens CQl-lOl protem 

Homo sapiens hexokmase type 1 . . 


392 
370 


61 

92 
81 


1 155 
156 
157 


Y16355 
G00857 
AF159455 


Homo sapiens | alternatively spliced form _ 

- Homo sapiens Human secreted protem, «tmu Mu: ^i^^iH. 

Mus zinc finger protein 

musculus . — — 


432 
349 
352 

537 


92 
78 
74 

76 


1 158 
159 


L76191 
AP001743 


" - Homo sapiens interleukin-1 receptor-associated kmase 
" " Homo sapiens putative gene, ankirin like, possible dual 

mecifitv Scr/Thr/Tyr kmase domain 


670 


98 


160 
1 161 


AJ250425 
1 G02885 


Rattus CollybistinI 

norvegicus | 

""Homo sapiens | Human secreted protem, SEQ ID NO: 6966. 


556 
1 370 


74 

1 100 



107 
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SEQ 

ID 1 

NO: 

162 


Accession 5 
Mo. 

Z22968 ] 


ionio sapiens 


Descnption 
S4130 antigen 

ATP-dependent Ca2+ pump rMKi 


Simth^ r 

Waterman 

Score 

610 

336 


% 

Identity 

100 
92 


163 
164 
165 


AF181121 
AF055636 
AF160798 


Homo sapiens 

Rattus 
norvegicus 


leuctne-^dh glioma-inactivated protein precursor 
calcium transporter Cal 1 


455 
700 


94 
96 


166 


Y76332 


Homo salens 


Fragment of human secreted protein encoded by 
gene 38. 

Unman VtrpjK^^ tumour«associ&ted protein 68> 


51/ 

mi 


99 


167 
16S 


Y48607 
AB020741 


Homo sapiens 
Mus 

musculus 


NDC-related kinase 


197 
596 


43 
44 


169 
170 


AF252293 
U59429 


Homo sapiens 
Crioetinae 
gen. sp. 


PAR3 

diacylglyoerol kinase eta 
phosphatidylserine-specinc uhospholipasc Al 


481 

386 


82 
42 


171 
17Z 


AF035268 
/\r I/./ woj 


Homo sapiens 
Mus 

musculus 


semaphorin cytoplasmic domain-associated 
protein 3B ' 


507 


"99 


173^ 


Y27918- 


Homo sapiens 


Human secreted protein encoded by gene No. 
123. 

Human secreted protein, SEQ ID NO; 7060. 


653 
538 


97 


174 
175 


G02979 
U36488 


Homo sapiens 
Mus 

musculus 


embryonic stem cell phosphatase 


"168 ' 


55 . 


176 


W95629 


Homo sapiens 


Homo sapiens secreted protein gene clone 

Kml96 4. 

fomiiminotransferase cyclodeaminase torm U 


1022 

255 


100 

93 


177 
17o 


AF289023 


Homo sapiens 
Homo sapiens 


T-cell receptor alpha-chain (413 is 2nd base in 
codon) \ . 


710 


99 


179 '-■ 
181 


AF127481 

000978 
Y66645 


Homo sapiens 

Homo sapiens 
Homo sapiens J 


non-ocogenic Rho GTPase-specific GTP 
exchange factor 

Human secreted protein, JSiig ID NO: 5059. 
Mcrabranc-bound protein PRO1310. 


175 

517 
671 
862 


80 

94 
96 
100 


182 
IW 
184 

lOJ 

186 


AF110640 
AF169691 

AT iZuj 

L20966 


Homo sapiens 
Bos taurus 
Homo sapiens 
Homo saoiens 

Homo sapiens 


orphan seven-transmembrane receptor 

orphan transporter short splicing variant 

cadherin-like protein VR8 , 

thyiotr(^in-relcasing hormone degrading 

ectoenzyme 

phosphodiesterase 


766 
375 
985 

541 


84 
38 
99 

76 


187 
188 

189 


G02920 
Y94918 

Y66713 


_Homo sapiens 
Homo sapiens 

Homo sapiens 


Human secreted protem, acw uj ihkj, # wi. 
Human secreted protein clone dd504_18 protein 
sequence SEOIDNO:42. 
Membrane-bound protein PRO1309. 


254 
301 

694 


93 
98 

100 


ion 
191 


G03244 
U36771 


Homo sapiens 

Rattus 

norvegicus 


Human secreted protein, SiiO ID NO: 7325. 
sn-glycerol 3-phosphate acyltransfcrase 


331 
707 


73 
92 


192 


R05935 


Homo sapiens 


■ "Secreted GPUb subunit of multiple §ubunit 
polypeptide (MSP)GPIlb-ina. 


157 


72 


193 


M92084 


Theileria 
parva 


casein kinase U alpha subunit 
Membrane-bound protein PRO1310. 


364 
448 


50 

90 


194 
195 


W95631 


Hnmo saoiens 
Homo sapiens 


Homo sapiens secreted protein gene clone 
hi968, 2. 


382 


49 
■"99 


196 


AR55614 


Rattus 
norvegicus 


scaffolding protein SUPR 


680 




397 


AC021640 


Arabidopsis 
thaliana 


" putative phosphatidatepnosphohydrolase 


300 


41 


198 
199 


AF073967 

"W01730 
AFl 17948 


Mus 

musculus 
domesticus 
Homo sapiens 
Homo sapiens 


' "Human G-orotein receptor Hi'RAJ70. 

pancreas-enriched phospholipase C . 


316 

617 
625 


43 

98 
89 


200 
201 
202 
203 

204 
205 


' AF128625 
' AFl 17946 
-■V53021 

AF227968 
S81752 


Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 


CPC42-binding protein kinase beta 

" T tnir piiflnme nucleotide exchange factor II 
" Human secreted protein clone qc640 protein 

se(^uenceSE01DNO:48. 

SH2-B beta signaling protein 
" nPH7T ^Andidate tumor suppressor gene 


" 636 
1303 
701 

182 
375 


" 94 
100 
" 99 

79 
100 
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SEQ 

ID 

NO: 

206 


Accession 
No. 

U18315 


Species 

Sus scrofe 


Description 

{ovarian cancer critical region of deletion} 
parathyroid receptor 

putative pheromone receptor VlRLl long form 


Smith- 

Watemian 

Score 

122 
170 


% 

Identity 

"60 
96 


207 
208 
209 
210 


AF255342 
S52051 
W63683 
D79992 


Homo sapiens 

lattus sp. 

iomo sapiens 
Homo sapiens 


neurotransmitter transporter 

Human secreted protein 3. 

similar to Drosophila photoreceptor cell-specific 

protem, calphotin. 


715 
840 
541 


94 
99 
82 


211 


AFl 17948 


iomo sapiens 


pancreas-enriched phospholipase C 


1348 


99 


212 


U81035 


Rattus 
Dorvegicus 


anlcyrin binding cell adbesion molecule 
neuroiascin 


471 
798 


69 
56 


213 
214 


AFl 54846 
AF102777 


Homo sapiens 
Mus 

musculus 


zinc finger protein 

FYVE fingcr-contaming phosphomositide kinase 

putative gene containing transmembrane domain 


933 

523 


93 
89 


215 

Hit 

217 


AL163303 
G04095 


Homo sapiens 

norvcgicus 
Homo sapiens 


prostaglandin F2a receptor regulatory protein 
precursor 

Human secreted protein, SEQ ID NO: 8176. 


563 

"644 

11/1 


78 
98 

Oi 


218 
219 
220 


X75756 
¥66723 
Doo577 


Homo sapiens 
Homo sapiens 
Mus 

musculus 


protein kinase C mu 
Membrane-bound protein PROUOO. 
KunfFcr cell receotor 

0TRPC4 


770 
567 

853 


98 
40 

100 


221 
222 


AF258465 
AF021935 


Homo sapiens 

Rattus 

norvegicus 


mytonic dystrophy tanasc-relatec uic4Z-Dmamg 
kinase 


636 


VO 


223 


AL136527 


Homo sapiens 


bA215B13.1 (A kinase (FRKA) anchor protein 

in 

WNT receptor Fri2zled-4 


"693 
"690 


100 
*99 


224 
225 


AB032417 
AF03G43O 


Homo sapiens 
Mus 

musculus 


semaphorin Via 


703 




226 


AE000218 


Escherichia 
coli 


putative dihydroxyacetone kinase (EC 2.7. 1.2) 
phosphoinositol 3-phosphate-binding protein-2 


297 
2080 


39 
100 


227 
228 


AF302150 
AB024573 


Homo sapiens 
Mus 

musculus 


OTP-binding like protein 2 


265 




229 

230 


AF122924 

G03205 


Xenopus 
laevis 

Homo sapiens 


Wnt inbibitoiy &ctor-l 

Human secreted protein, SisU ID NO: 7af6. 


229 


40 

100 
92 


231 
232 
233 

234 


X98260 
R92754 
R75111 

W69431 


Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 


M-phase phosphoprotcin 1 1 

Human growth differentiation factor-12. 

Glycosyl-phosphatidylinositol-specific 

phospholipase-D. 

Human secreted protein cwlZJ.J_J. 


682 
859 


95 
100 

97 
81 


235 
237 


Y08686 
X81466 


Homo sapiens 
Homo sapiens 
Mus 

musculus 


serine palmitoyltransferase, subunitll 
atrophiiMlated protein ARP 
Embiyo Brain Kinase 


117 
460 


37 
62 


238 


U64857 


Caenoriiabditi 
s elegans 


similar to the BPTI/Kunitz family of inhibitors, 
most similar to tissue factor pathway inhibitor 
precursor (TFPI) 




33 


239 


AJ250840 


Mus 

musculus 


serine/threonine protein kinase 


739 


63 


240 


AJ223472 


Mus 

musculus 


transcription elongation factor TFlIS.h 


222 


38 


241 


Y94906 


Homo sapiens 


Human secreted protein clone rb649_3 protein 
sequence SEQ ID N0:18. 
Na+/sulfate cotransporter SUT-1 


353 
591 


52 
"99 


242 
243 


AF169301 
L22022 


Homo sapiens 

Rattus 

norvegicus 


orphan tiansporter v7-3 


667 


93 


244 


AF016191 


Rattus 
norvegicus 


potassium channel 


'1043 


■95 


245 
246 
247 
248 
249 


AF097366 

Y29868 

AF180475 

Y17227 

AF250910 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Manduca 


cone sodium-calcium potassium exchanger 
Human secreted protein clone pp325 .9. 
Not4-Np 

Human secreted protein (done yal-1). 
death-associated small cytoplasmic leucme-rich 


645 
497 
188 

690 
182 


98 
98 
83 
99 
31 
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ID 
NO: 


Accession 
No, 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 






sexta 


protein SCLP 






250 


AF192756 


Kaposi's 
sarcoma- 
associated 
herpesvinis 


Oif73 


134 


34 


251 


AB022694 


Homo sapiens 


MOK protein kinase 


209 


83 


252 


W55045 


Homo sapiens 


Neural adhesion molecule (cthb0018f2 product). 


469 


100 




L46815 


Mus 

musculus 


DNA binding protein Rc 


231 


67 


254 


W68505 


Homo sapiens 


Human acid sensing ionic channel 


173 


82 




AF070066 


Mus 

musculus 


Citron-K kinase 


1201 


98 




G02491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6572. 


460 


100 


257 


Zl284i 


Oiyctolagus 

riintPiiliiQ 


Phospholipase 


368 


80 


258 


Y95436 


Homo sapiens 


Human calcium channel SOC-3/CRAC-2. 


1857 


"99 


259 




Mus 

mi ic^lill IC 


1^ Ubl 1 iWll 1 


430 


72 


260 


AJ250839 


Homo sapiens 


serine/threonine protein kinase 


861 


100 


261 


AJZ49977 


Homo sapiens 


AMT>.»rtivflted nrotein kinase canuna 3 subunit 


758 


98 


262 


AF14138D 


Rattus 
norvegicus 




198 


40 


263 


AF02285y 


Homo salens 




335 


62 


264 


AF160477 


Homo sapiens 


Ig supcrfamily receptor LNIR precursor 


387 


91 


263 


Y44662 


Homo sapiens 


(GPCR). 


636 


99 


266 


U27269 


Mus 

musculus 


SOQium glucose wjutuioj/ui LCI 


204 


56 


267 


AF124491 


Homo sapiens 


/Vtvr VJIa BSiC'abUVCUpUlg |/ivivui >jha< 


139 


75 


268 


AF127389 


Rattus 
norvegicus 


putative taste receptor TRl 


209 


39 


269 


X98296 


Homo sapiens- 


ubicjuitin hydrolase 


215 


95 


270 


X78482 


Strcptococcus 
pyogenes 


Fc*ganuna receptor 


129 


26 


271 


AB009883 


Nicotiana 
tabacum 




109 


26 


272 


AFl373o7 


Mus 

musculus 


VP^Ifl rinmiiin recentor Drotcin SORCS 

y^^jiy UmUlim iwWf'lwA |/iwfcw*** wXi'*-*'^— 


"199 


97 


273 


L34938 


Rattus 


innntrnnir crliltflmate reCCDtOr 


460 


86 


274 




nQmu SapiCllS 


dJ413H6.1.1 (hamster Androgen-dependent 
Expressed Protein LIKE PUTATIVE protein) 
(isofonn 1) 


TiS 


74 






Homo sapiens 


ubiquitin-conjugating BIR-domain enzyme 
APOLLON 


173 


"94 


A /V 


G02872 


Homo saoiens 


Human secreted protein, SEQ ID NO: 6953. 


148 


56 


277 


L40380 


Homo sapiens 


thyroid receptor interactor 


430 


61 


^ /o 




Homo sapiens 


KIAA1631 protein 


283 


96 


270 




Arabidopsis 
thaliana 


Contains PI^00069 Eukaiyotic protein kinase 
domain. 


157 


43 


280 


M83738 


Homo sapiens 


protein-tyrosinc phosphatase 


181 


73 


281 


AK024397 


Homo sapiens 


unnamed protein product 


439 


91 


282 


AF141326 


Homo ss4)iens 


RNA helicase HDB/DlCEl 


497 


84 


283 


AF156530 


Mus 

musculus 


ETS-domain transcriptional repressor PEl 


605 


76 


284 


Y29336 


Homo sapiens 


Human secreted protein clone cs756_2 alternate 
reading frame protem. 


647 


100 


285 


Y73402 


Homo sapiens 


Human secreted protein done yc25_l protein 
sequence SEQ ID NO:26. 


300 


90 


286 


AF016411 


Homo sapiens 


KCNA3.1B 


137 


100 


287 


W89253 


Homo sapiens 


Human ALP. 


688 


97 


288 


AFl 12886 


Bos taurus 


differentiation enhancing factor 1 


750 


96 


289 


AFn3131 


Homo sapiens 


host cell factor horaolog LOP 


367 


44 


290 


U52111 


Homo sapiens 


plexin-related protein 




100 


291 


AF026S04 


Rattus 


SPA-1 like protein pl294 


603 


89 
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SEQ 

ID 

NO: 


Accession 
No, 


Species 


DescnptioD 


Smith* 
Watcnnan 

Score 


% 

Identic 


292 


AF102854 


Rattus 
norvegicus 


membrane-associated guanylate kinase- 
interacting protein 2 Maguin-2 


124 


53 


293 


X99211 


Drosophila 
melanogaster 


ubiquitin-specific protease 


143 


38 


294 
295 


Y94943 

Y94890 


Honx) sapiens 
Homo sapiens 


Human secreted protem clone yw*„i prwcm 

scc|uenceSEQIDNO:92. 

Human protein clone HP02798. 


185 

108 
154 


94 

59 
96 


296 
297 
298 


AF019767 

Y28568 

Y94943 


Homo sapiens 
Homo sapiens 
; lomo sapiens 


zinc finger protein 

Secreted peptide clone bd577_l. 

Human secreted protein clone ytl4_l protein 

sequence SEQ ID NO:92. 


568 
182 


84 

97 

"69 


299 

300 
301 


B08906 

R58890 
AF022859 


Homo sapiens 

Homo sapiens 
Homo sapiens 


Human secreted protein sequence encoded by 
oRtif. 1 a SEO ID NO:63. 
Human-32 cadherin-related molecule. 
neuropilin-2(a0) 

Unman mifntrenic regulator (lU0X2. 


60S 

212 
277 
716 


97 

100 

97 


302 
'303 
304 
305 


Y71124 
Y44297 
D32050 
U43586 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


Human receptor tyrosine kinase. 
alanyl-tRNA synthetase 

* trinac^ iviflfftd tfs Rfif nrotBin kinsscsz 
protem Kinase reiiuGu w i\«u }/ivu»ui 

Method: conceptual translation supplied by 
author 

Human H13 viral receptor mutant 4. 


228 
192 
428 

280 


97 
80 
72 

9j 


306 
307 


R54872 
D78572 


Homo sapiens 
Mus 

musculus 


membrane glycoprotein 


199 


41 


308 
309 


AF255614 
S79463 


Rattus 
norvegicus 
Mus sp. 


scaffolding protein SLIPR 

semaphorin homolog^-Sema F 


639 

162 


"88 

J 

89 


310 
311 


AF178941 
U03413 


Homo sapiens 
Dictyostelium 
discoideum 


A T"!} wiwtAintT r'ficc^ftft citb-fiuniiv A fliemo6r 2 
calcium binding protein 


736 
151 


100 
36 


312 


Y87347 


Homo sapiens 


Human signal peptide containing protein HSPP- 

124SEQIDNO:124. 

dJ388M5.4 (putative GS2 like protein) 


744 

789 . 


lOU 

99 


313 
314 


Z97055 
AC004010 


Homo sapiens 
Homo sapiens 


~V>»Tiii<ir in 1 ^itcine.rich transmembrane proteinsi 
44% similarity to U42767 (PID:gl736918) 


197 


38 


315 


AL021392 


Homo sapiens 


dJ439F8.2 (supported hy GENSCAN and 

rjFhJFWISE^ 


278 


38 


316 


U7a209 


Mus 

musculus 


polycystic kidney disease 1 protem 


165 


38 


7317 


AF109643 


Rattus 
norvegicus 


coxsackifr-adenoviros-receptpr bomolog 


Z23 
138 


JO 


319 
320 


API 04923 
AF100287 

CjUUSoo 


Homo sapiens 
Tiypanosoma 
vivax 


putative transcription factor 

activated protein kinase C receptor homolog 

Human secreted protein, SEO ID NO: 4669. 


141 

125 


38 
51 


321 
322 


Y21591 
D26070 


Homo sapiens 
Homo sapiens 


Human secreted protein (clone CC332-33). 
human type 1 inositol 1,4,5-trisphosphate 

receptor 


459 
232 


97 
97 


323 


Y27918 


Homo sapiens 


Human secreted protein encoded liy gene No. 
123. 

neuronal thread protein AD7C-NTF 


306 
209 


88 
70 


324 
325 


AF010144 
M19650 


Homo sapiens 
Homo sapiens 


■ ■ 2'.3*-cyclic-nucleotide 3'-phosphodiestBrase {tJfs 

3.1.4.37) 

A secreted protem encooea oy clone dpo«k>_i v. 
protein kinase C mu 

" "Human secreted protein, SbX> IDNO: 6373. 


214 
' 140 


97 

"70 
" 78 
" 99 


126" 
327 
328 


" W80396 
" X75756 
G02292 


" [Homo sapiens^ 
' ^omo sapienT 
Homo sapiens 


""540 
721 


329 
330 


AFl 68990 
S67984 


Homo sapiens 
Homo sapiens 


putative GTP-binding protein 
' anti-HIV gpl20 antibody heavy chain varianie 
region 

LDL-rcceptor related precursor ^AA -19 to 45Z5) 


877 
581 

2823 


" 99 
80 

98 


331 
332 

333 
334 


X13916 
Y87330 

Y28503 
AC002563 


Homo sapiens 
Homo ssqiiens 

Homo sapiens 
Homo sapiens 


Human signal peptide containing protein HSPP- 
107SEQIDNO:107. 

HGFH3 Human Growth Factor Homologue 3. 
■ ■ putative RHO/RAC effector pioteim 95% 


1127 

320 
327 


100 

98 
93 



111 
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SEQ 1 
ID 1 
NO: 


Accession 
Ho. 


Species 


Description 


Watennan 
Score 


% 

Identity 


335 


y87347 


Homo sapiens 


similarity to P49205 (l'lD;gl34586U) 

Human signal peptide containing protein HSPP- 

124SEQIDNO:124. 


nil 


67 


336 


AF006466 


Mus 

mi ic nil 111 6 


lymphocyte specific formin teiaied protdn 


193 


75 


337 
338 


AF265555 
¥13443 


Homo sapiens 
Homo sapiens 


ubiquitin-conjugating BlR-domain en^e 
APOT,I/>N 

Amino acid sequence of hSlo3-2. 
putative GAB A-gated ^loride channel 


632 

516 
189 


97 

100 
100 


339 

340 1 
341 


Y07637 
Y05734 
AE000497 


Homo sapiens 
Bomo sapiens 
Escherichia 
coli 


Human Grt)7 effector 2J2AVI protem. 
Li'iuonsuc uiiiioCiipiiuiitti ik«guiiuvi 


2156 
928 


99 
98 


342 


D90855 


Escheri^a 
coli 


giyceroi-j-pnospnaie acnyoruftcuna© 
1.1.99.5) chain A, anaerobic 


769 


99 


343 


D85613 


Escherichia 
coli 


membrane component 


"399 


100 


344 


M93239 


Escherichia 
coli 


transmembrane protein 


232 


100 


345 


M60177 


Escherichia 
coli 


enterobactin 


759 


99 


346 


D90699 


Escherichia 
coli 


Soisor protein cops ip^^ ^- f-^-)- 


638 


97 


347 


D90843 


Escherichia 
coli 


CapB protein. 


552 


100 
"96 


348 


M13422 


Escherichia 
coH 


49 kd protein 


1193 




349 { 


L10328 


Escherichia 
coli 


similar to drug resistance translocases 


340 


90 


350 


X69942 


Mus 

musculus 


enhancer-trap-locus-l 


560 


82 


351 


AF239613 


Homo sapiens 


apamin-sensilivc small-conductance c:a2+- 
activated potassium channel 


463 


80 


352 


1596777 


Escherichia 
coli 


3-hydioxybutyTyl-CoA dehydrogenase Cli^ 
) . 1 . 1 .15^ (b- hydroxybutyiyl-CoA 
dehydrogenase) (BhbD). 


577 


100 


353 . 


D90863 


Escherichia 
coli 


similar to 

Human transmembrane protein HP02000. 


311 

133 


98 

58 


354 
355 

356 


Y52386 
Y31645 

Y58637 


Homo sapiens 
Homo sapiens 

Homo sapiens 


Human transport-associated protcin-7 CTRANP- 
7). 

Prnfein re^ulatinc eene expression PROE-30. 


482 
119 


55 
51 


357 
358 

359 


1 AF119226 
Y87219 

J00132 


Homo sapiens 
Homo sapiens 

Homo sapiens 


dual-specificity tyrosine phosphatase YVHl 
Human secreted protein sequence SEQ ID 
NO:258. 
beta-fibrinogen 
" "Human secreted protein, SEQ ID NO: 7870. 


■1788 
165 

233 
128 


100 
100 

93 
70 


360 
361 
362 

363 
364 
365 


G0378y 
R28916 
U16655 

GOBI 19 
U47276 
G03789 


riomo sopiciid 
Homo sapiens 
Rattus 
norvegicus 
Homo sapiens 
Gallus gallus 
Homo sapiens 


Type ni procollagen (prior art), 
phospholipase C delta-4 

' Human secreted protem, SEQ ID NO: 72U0. 
chicken brain factor-2 

Human secreted protein, SEQ ID NO: 7870. 


108 
649 

104 
183 


40 
65 

42 
34 
65 


366 
367 
368 
369 


1 G04091 
X98258 
AL021366 
U70932 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Peromyscus 
leucopus 


Human secreted protein, SEQ ID NO: 8172. 

M-phase phosphoproteia9 

CICK0721 0.3 (Kinesin related protein) 
reverse transcriptase 


118 
564 
3387 
92 


46 
75 

5.9 


370 
371 


X86400 

1 G03172 


Homo sapiens 

Homo sapiens 


gamma subunit of sodium potassium ATPase 
like 

' Human secreted protein, SEQ ID NO: 7253. 


242 

165 
257 


73 

56 
55 


372 
373 
374 

375 


U49974 
1 X13916 
AF234765 

1 U49974 


Homo sapiens 
Homo sapiens 
Rattus 
norvegicus 
Homo sapiens 


mariner transposase 

LDL-rcceptor related precursor (AA -iV to 4DZ3) 
" serine-arginine-rich splicing regulatory protein 
SRRP86 

mariner transposase 


21193 
1182 

172 


99 
78 

55 
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382 



U64830 



Dictyostelium 
discoideum 



protein tyrosine kinase 



383 



384 



G02916 



Q01194 



Homo sapiens 



Human secreted protein, SEQ ID NO: 6997. 



Homo sapiens 



Hnman secreted protein, SEQ ID NO: 5275. 



618 
617 



AJ245822 



Homo sapiens 



type I transmembrane receptor 



4560 



386 



387 



388 
389 



D86974 



Homo sapiens 



KIAA0220 



G03203 



Homosapiens 



Homo sapiens 



Human secreted protein, SbCj iUNOr7284. 



142 



Human secreted protein. SEQ ID NO: 8153. 



99 



390 



391 



392 



393 



394 



395 



M12140 



AJ293309 



Y42751 



W48351 



Y14442 



W85607 



Y76332 



G03930 



Homo sapiens 



envelope protein 



Homo sapiens 



>fHP2 protein 



461 



Homo sapiens 



Human calcium binding protein 2 (CaBP-2). 



181 



Homo sapiens 



Human breast cancer related protein BC3tB2^ 



241 



Homo sapiens 



Homo sapiens 



olfactory receptor protem 

Secreted protein clone da228_6. 



957 



Homo sapiens 



Fragment of human secreted protein encoded by 



171 



gene 



38. 



Homo sapiens 



Human secreted protem, Slig iDNO: 8011. 



250 
105 



397 



AB032904 



Hylobates 
syndactyhis 



dopamine receptor D4 



398 



399 



400 



401 



403 
404 



AJ007798 



Y91405 



Y29861 



D87002 



AF100754 



X7490r 



Homo sapiens 



stromal antigen 3, (STAG3) 



Homo sapiens 



Human secreted protein sequence encoded by 
gene 2 SEQ ID NO: 126. 



Homo sapiens 



Human secreted protein clone cb98_4^ 



1047 
T62" 



Homo sapiens 



similar to rat integral membrane glycoprotein; 
accession number Z21 5 13. 



527 



Homo sapiens 



ancient ubiquitous protein AUPl isofoim 



853 



Gallus gallui 



alpha-2-macroglobulin receptor 



258 



405 



406 



407 



408 



409 



410 



AF075462 



Mus 
musculus 



X92887 



Human 
endogenous 
retrovirus K 



ADP-ribosylation factor-directed GTPase 

activating protein isofonn b 

pol/env 



162 



Y30162 



Homo sapiens 



Human dorsal root receptor 4 hDRR4. 



325 



AK022626 



Homo sapiens 



unnamed protein product 



L13802 



Homo sapiens 



Y91600 



Homo sapiens 



ribosmal protein small subunit 

Human secreted protein sequence encoded by 
gene9SEQI DN0373. 



2833 

TtST 



W88745 



Homo sapiens 



Secreted protein encoded by gene 30 clone 

HTSEV09. 

Chat-H 



2004 



411 



412 



AB043953 



Mus 

musculus 



Y86233 



Homo sapiens 



Human secreted protein HNTftmg. SEQ ID 
NO:148. 



ToTT" 




Arabidopsis 
thaliana 
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SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 

Score 


% 

Identity 


423 


AF231705 


Homo sapiens 


AJu cc-repressor 1 


1090 


100 


424 


An34887 


Homo sapiens 


FLAMINGO 1 


6268 


97 


425 


Y35942 


Homo sapiens 


Extended human secreted protem sequence, SEQ 
ID NO. 191. 


1961 


99 


426 


AB009288 


Homo sapiens 


N-copine 


635 


98 


427 


L12392 


Homo sapiens 


Huntington's Disease protein 


16080 


99 


428 


Y94990 


Homo sapiens 


Human secreted protein y\a\J, SEQ ID NO:20. 


768 


98 


429 


AJ293573 


Homo sapiens 


zinc finger protein Cezanne 


542 


87 


410 


Y84441 


Homo sapiens 


Amino acid sequence of a human RNA- 
associated protein. 


2074 


100 


431 


G02850 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6931. 


723 


95 




G04067 


Homo sapiens 


Human secreted protein, SEQ JD NO: 8148. 


73 


42 




AF 159296 


Lycopersicon 
esculentum 


extensin-Uke protein 


613 


48 


Aid 


W48351 


Homo sapiens 


Human breast cancer related protein BCRB2. 


135 


44 




X73874 


Homo sapiens 


phosphotylase kinase 


3442 


97 




AF 161 426 


Homo sapiens 


HSPC308 


268 


74 


All 




14nmo (^aniens 


Human secreted protein encoded from gene 2. 


1055 


52 


438 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7879. 


168 


56 


43 y 


A14/00 




GABA-A receptor alpha 1 subunit 


2294 


96 


44U 






beta-tubuiin 


311 


95 


441 




\Snm/\ cum An C 


activating signal cointegrator 1 


1882 


• 100 


442 


111 CTi 




7inc iinser orotein 


795 


54 


443 






Human secreted protein, SEQ ID NO: 7284. 


93 


26 


444 




UUlUCIlkliiwU 


HUMAN NDR 


2451 


100 


445 


X98330 


Homo sapiens 


ryanodine receptor 2 


9356 


99 


446 


AFI loTIZ 


nomo Sapiens 


PR02738 


227 


49 


447 


AF24S447 


Homo sapiens 


sphingosine kinase type 2 isoform 


576 


99 


448 


AFI 3 3086 


Homo sapiens 




2630 


94 


449 


U87305 


Rattus 
norvcgicus 


transmembrane receptor TJNC5H1 


817 


93 


450 


Ar0olZ4y 




JAW 1 -related protein MR VII A iong isoform 


"4568 


"99 


451 


ALUUD4VO 


Urvm/v eanii>nc 
MOUlv SapiClid 


R31665 1 


316 


62 


452 


M60235 


Homo sapiens 


granule membrane protein- 140 


464 


73 


453 


AB0367u6 


Homo sapiens 


Intel ecti n 


730 


88 


454 


G00918 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4999. 


263 


81 


455 


Y22634 


Homo sapiens 


Uiimnn rvtnkine inducible r^ulatOTV DTOtein-I 

(CIRP-l). 


192 


67 


456 


Y3o705 


Homo sapiras 


Praoment of human secreted Drotetn encoded by 
gene 62. 


106 


40 . 


457 


N91323 




DNA encoding human growth hormone -receptor. 


3282 


"96 


ACQ 

458 




riasmoQiuin 
falciparum 


i^-fintipfin nrecursor 


110 


36 


459 


jiiili 


UnmA CQfilmc 


Amino acid seQuence of protein PR0257. 


509 


98 


460 




Jlumo aopicus 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


149 


43 


40 J 




XIWUV 5ll|llwUd 


Fragment of human secreted protein encoded by 
gene 17. 


184 


54 




Y530O5 


Homo sapiens 


Human secreted protein clone pm749_8 protein 
sequence SEQ ID N0:16. 


135 


47 




X84dd) ' ■ 


Triticum 
aestivum 


low molecular weight glutenin 


109 


33 


464 


W19919 


Homo s^iens 


Human Ksr-1 (kinase suppressor of Ras). 


1781 


85 


465 


AF189764 


Mus 

musculus 


alpha/beta hydrolase- 1 


502 


jy 


466 


U93569 


Homo sapiens 


p40 


101 


30 


467 


Y41528 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 77.. 


1172 


99 


468 


G02872 


Homo sapiens 


Human secreted protein. SEQ ID NO: 6953. 


149 


52 


469 


AJ000008 


Homo sapiens 


PI3^asc 


5832 


97 


470 


X70922 


Mus 

musculus 


neurotoxin homologue 


118 


47 


471 


G03797 


Homo sapiens 


" "Human secreted protein. SEQ ID NO: 7878. 


198 


75 


472 


Y36705 


Homo sapiens 


Fragment of human secreted protein encoded by 


72 


57 



114 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 


Accession 
No. 


Species 


Description 


Waterman 
Score 


% 

Identity 








gene 62. 






473 


G02313 


Homo sapiens 


Human secreted protein, Stg ID NO: 6394. 


328 


100 


474 


Y07007 


Homo sapiens 


Breast cancer associated antigen precursor 
sequence. 


inn 


97 


475 


W93254 


Homo sapiens 


Human ESRPl protein. 

Human breast cancer relatea proiem iJ<^Kt>^. 


943 


80 
65 


477 


W48351 
Y02693 


Homo sapiens 
Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


202 


60 


478 


GDI 870 


Homo sapiens 


Human secreted protein, siai^ VJ jnu: oy^i. 






479 


AF102777 


Mus 

musculus 


FYVE finger-containing phosphomositide kinase 


3427 


92 


480 


G03052 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7133. 


123 


53 


481 


Wo7 /Ul 


Ifnmn can i ens 


A human membrane fusion protein designated 
SYTAXl. 


221 


77 


482 


G03119 


Homo sapiens 


Human secreted protein, SJby lu nu: /zuu. 


1.^ 1 
124 


'^Q 
J7 

59 


483 
484 


AF2 10651 
AF010144 


Homo sapiens 
Homo sapiens 


NAG18 

neuronal thread protein AD7c-NTP 


343 




485 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4718. 


129 


70 


486 


U15174 


Homo sapiens 


BCL2/adenovirus ElB I91cl>-interacting protein 
% 


149 


73 


487 
488 


Y76167 
AJ275213 


Homo sapiens 
Homo sapiens 


Human secreted protein encoded by gene 44. 
stabiiin-1 


627 

"1243 


100 
91 


489 
490 


G03798 
L12392 


Homo sapiens 
Homo sapiens 


Unman c^rrfie-A TirnteirL SEO ID NO: 7879. 

Huntington's Disease protein 

Human secreted protein, SKg ID NO: 7870. 


313 

16081 

197 


65 

100 

^ 


491 
492 


G03789 
J03799 


Homo sapiens 
Homo sapiens 


laminin-binding protein 


228 


70 


493 


U15174 


Homo sapiens 


BCI/2/adcnovirusElB l91cD-interacting protein 
3 


128 


41 


494 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


197 

- 


67 


495 


AC005175 


Homo sapiens 


R31449 3 

Human secreted protein, SEQ ID NO: 7867. 


889 
229 


y4 

Ol 


496 
497 


G03786 
AB030237 


Homo sapiens 

Canis 

familiaris 


D4 dopamine receptor 

Human secreted protein, SEQ ID NO: 6953. 


on 
90 

228 


0-) 


498 
499 

500 


G02872 
U70935 

U48508 


Homo sapiens 
Peromyscus 
maniculatus 
Homo sapiens 


reverse transcriptase 

skeletal muscle ryanodine receptor 


213 
26406 


52 
99 


501 

502 


G03371 
AFn9851 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 7452. 
PR01722 


105 
156 


DO 


503 


AFl 13685 


Homo sapiens 


PRO0974 


1 lb 


<n 

DU 


504 


U79458 


Homo sapiens 


WW domain binding protein-2 


322 


Dy 


505 


W29651 


Homo sapiens 


Human secreted protein CD124 3. 


oOo 




506 


W85459 


Homo sapiens 


Secreted protein encoded by clone dhl 135_9. 


986 
"TT« " 


70 


507 


Y86265 


Homo sapiens 


Human secreted protein HUSXE77, SEQ ID 
NO:180. 


Wo 




508 


AL160175 


Homo sapiens 


bA243J163 (similar to MYLK (myosm, ligW 
polypeptide kinase)) 


lo4 




509 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


y 1 


62 


510 


G03789 


Homo sapiens 


Human secreted protem, obQ uj nu. /b/u. 


1 17 


63 


511 


W79092 


Homo sapiens 


Human secreted protein dn740 3. 


1058 


100 


512 • 


ArUlUl*l4 




neuronal thread protein AD7c-N 11' 


205 


64 


513 


AJ 133439 


Homo sapiens 


GRIPl protein 


2151 


100 


514 


AE003456 


Drosophila 
melanogaster 


CG6393 gene product 


259 


42 . 


515 


Z17206 


Xenopus 
laevis 


p46Xl£g|22 


128 


40 


516 


AF104413 


Homo sapiens 


large tumor suppressor 1 


1766 


94 


517 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7878. 


92 


40 


518 • 


AF151083 


Homo sapiens 


HSPC249 


444 


98 


519 


S80864 


Homo sapiens 


cytochrome o-like polypeptide 


318 


50 


520 


X92485 


Plasmodium 
vivax 


pval 


170 


61 



115 



wo 01/57188 



PCTAJSOl/03800 



SEQ 
ID 

NO: 


Accession 
No. 


Species 


Description 


OlUlUl' 

waiciman^ 
Score 


At 

lucnuiy 


521 


G03790 


Homo sapiens 


Human secreted protem, SEQ ID NU: /o/i- 




so 


522 


AF121857 


Homo sapiens 


sorting nexin 7 




An 


523 


G02654 . 


Homo sapiens 


Human secreted protein, SKQ lU MU: o/JD. 


S9 


Si 


524 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 cloEe 

HPMBQ32. 


£jO 




525 


AFl 19851 


Homo sapiens 


PRO 1722 


lOZ 


J 1 


526 


Y27761 


Homo sapiens 


Human secreted protein encoded by gene No. 47. 


1 J*t 


Di 


527 


G02707 


Homo sapiens 


Human secreted protein, SKQ lu NU: e 


/U 


«*J 


528 


U47924 


Homo sapiens 


C8 


1110 


60 


529 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8144. 


84 


45 


530 


G03203 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7284. 


111 


60 


531 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8148. 


92 


65 


532 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7348. 


75 


29 


533 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7284. 


182 


48 


534 


AF068286 


Homo sapiens 


HDCMD38P 


861 


100 


535 


U07707 


Homo sapiens 


epidermal growth factor receptor substrate 


228 


60 


536 


GO 1955 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6036. 


484 


75 


537 


AF219232 


Gallus gallus 


q in-induced kinase 


206 


53 


538 


AFl 35022 


Homo sapiens 


mediator 


128 


100 


539 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7348. 


141 


59 


540 


AF016430 


Caenoihabditi 
s elcgans 


contains similarity to a BR-CTl i aomain 


853 


39 


541 


AC003093 


Homo sapiens 


' OXYSTEROL-BINDING PROTEIN; 45% 
similarity to P22059-(PID:gl29308) 


408 


"66 


"542 


M29487 


Homo sapiens 


integrin alpha subunit precursor 


517 


81 




AFl 02530 


Mus 

musculus 


olfactory receptor F3 


327 


73 


544 


Y73431 


Homo ss^iens 


Human secreted protein clone ybl86_l protein 
sequence SEQ ID NO:84. 


386 


100 


545 


AE004833 


Pseudomonas 
aeruginosa 


probable TonB-dcpendent receptor 


279 


42 


546 


003793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7874. 


264 


53 


547 


Y69192 


Homo sapiens 


A human monocyte-macrophage apolipoprotein 
B receptor protein. 


1772 


67 


548 ' 


Y91493 


Homo sapiens 


Human secreted protein sequence encoded by 
gene43SEQIDNO:166. 


176 


100 


549 


G01571 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5652. 


777 


99 


550 


AF044588 


Homo sapiens 


protein regulating cytokinesis I; PRCl 


1953 


88 


551 


Y29332 


Homo sapiens 


Human secttted protein done pe584_2 protein 
sequence. 


1224 


94 


552 


X98330 


Homo sapiens 


ryanodine receptor 2 


24621 


99 


553 


Y42782 


Homo sapiens 


Human UC Band ^331 protein. 


684 


95 


554 


AB025258 


Mus 

musculus 


gninuphilin-a 


501 


41 


555 


AJ010346 


Homo sapiens 


RING-H2 


1468 


100 


556 


W92388 


Homo sapiens 


Human TR-interacting protein S239a. 


538 


92 


557 


AF119851 


Homo sapiens 


PR01722 


175 


59 


558 


AFl 17756 


Homo sapiens 


thyroid hormone receptor-associated protein 
complex component TRAP! 50 


183 


32 


559 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


319 


68 


560 


D86214 


Mus 

musculus 


Ca2+ dependent activator protein for secretion 


1010 


93 


561 


AF187325 


Canis 
familiaris 


melanoma antigen 


"37 


55 


562 


AJ001981 


Homo sapiens 


6XA1L 


2512 


99 


563 


Z17238 


Rattus 
norvegicus 


glutamate receptor sub^e delta-1 


338 


66 


564 


W30638 


Homo sapiens 


Partial human 7-transmembTane receptor 
HAP0167 protein. 


371 

467 1 


100 


565 


AC005620 


Homo sapiens 


R33590 1 




97 


566 


Y99358 


Homo sapiens 


Human PR01772 (UNQ834) amino add 
sequence SEQ ID N0:63. 


1138 


78 


567 


AL031177 


Homo sapiens 


dJ889M153 (novel protem) 


1002 


58 


568 1 


AF15HW3 




HSPC209 


798 


100 



116 



wo 01/57188 



PCT/USOl/03800 



SEQ 
ID 

NO: 



569 



570 



571 



572 



573 



574 



Accession 
No. 



AF097518 



AB035698 



Y07096 



AU)31177 



YSSST 



AB037108 



D43949 



Species 



description 



Homo sapiens 



iver-specific transporter 



Homo sapiens 



Slisshapen/NIK-related kinase MINK-l 



Homo sapiens 



Colon cancer associated antigen precursor 
sequence. 



Homo sapiens 
Homo sapiens 



(1J889M15.3 (novel protein) 



'' vlembrane-bound protein PRO290. 



Homo sapiens 



Homo sapiens 



Smith- 

Waterman 

Score 



seven transmembrane domain orphan receptor 



This gene is novel. 



231 



1532 



1064 



735 



254 



1883 



836 



576 



577 
578 



Y48596 



Homosapiens 



iuman breast tumour-associated protein 57, 



G00352 



Homo sapiens 



Anmm secreted protein, SbQ lO NO: 4433. 



T4r 



R95913 



Homo sapiens 



' t^cural thread protein. 



AK025n6 



Homo sapiens 



unnamed protein product 



201 
77 



580 



581 
582 
583 



584 



585 



586 



587 



588 
589 



y86473 



Homo sapiens 



■luman gene 52-encoded protein fragment, SEQ 
IDNO:388^ ; 



AF196779 
AF 1 88706 
AB03O234 



Homo sapiens 
Homo sapiens 
Canis 
familiaris 



JMIO protein 
g20 protein 
D4 dopamine receptor 



450 
330 
64 



G02621 



Homo sapiens 



Human secreted protein, SEQ iD NO: 6702. 



345 



AL096828 



¥30819 



G00357 



Homo sapiens 



dJ963E22.1 (Novel protein similar to NY-REN-2 
Antigen) 



268 



Homo sapiens 



Human secreted protein encoded from gene 9. 



23r 



Homo sapiens 



Human secreted protein, SEQ ID NO: 4438, 



132 



Homo sapiens 



Human secreted. protein, SEQ ID NO: 6953. 



182 



AF235017 



Mus 

musculus 



2P1 protein 



590 



591 



592 



593 



594 
595 



W88627 



Homo sapiens 



Secreted protein encoded by gene 94 clone 
HPMBQ32. 



Y30709 



Homo sapiens 



Amino acid sequence of a human secreted 



no 



protem 



Y53875 



Homo sapiens 



Y53051 



Homo sapiens 



A human seven transmembrane signal transducer 
polyp€ptid< 



1369 



Human secreted protein clone ddl 19_4 protein 
sequence SEQIDNO:108. 



1112 



Y27658 



Homo sapiens 



Human secreted protein encoded by gene No. 92. 



763 
156 



596 



G03798 



Homo sapiens 



Human secreted protein, SEQ lU NU: /» /y. 



AF15inO 



Mus 

musculus 



CO?l protein 



2215 



597 



598" 
600- 



G03786 



Homo sapiens 



Human secreted protein, ^EQ ID NO: 78677 



AFl 92499 

APn985 5 
GQ2872 



Y00295 



Mus 

musculus 
Homo sapiens 
Homo sapiens 



putative secreted protein ZS1G37 

PR01847 

Human secreted protein, SEQ ID NO: 6953. 



157 
143 



Homo sapiens 



Human secreted protein encoded by gene 38. 



567 



602 



603 



AF184971 



AF061936 



Homo sapiens 



class II cytokine receptor ZCYT0R7 



Homo sapiens 



diacylglycerol kinase iota 



773 
T33r 



609 



610 



AL096628 



Homo sapiens 



dJ963E22.1 (Novel protein 
Antigen) 



similar to NY-KEN-2 




W88627 



Homo salens 



Secreted protein encoded by gene 94 clone 
HPMBQ32. 



Y27868 



Homo sapiens 



Human secreted protem encoded by gene Na 



339 



107. 



AF202636 



Homo sapiens 



angiopoietin-like protein PPl 158 



612 
613 



Homos 



PRO0663 



Y02693 



Homo sapiens 



Human secreted protein encoded by gene 44 
clone HTDAD22. 



614 



M87053 

AC004232 
GOl 



Katms 
norvegicus 
Homo sapiens 
Homo sapiens 



lens membrane protein 

FPM315 

Human secreted protein, g 



> ID NO: 6065. 



450 

TST 

205 



117 



wo 01/57188 



PCTAJSOl/03800 



r 

SEQ 

ID 

NO: 


Accession 
No. 


-H 1 r 

SpeciGS 


r^^t'^f inti nn 
l,/65UIipilD[l 


Smitti- 

Waterman 

Score 


% 

Identity 


617 


Y91524 


Homo sapiens 


Human secreted protein sequence encoded by 
gene74SEQIDNO:197. 


821 


"99 


618 


AJ245621 


Homo sapiens 


Ul Li protein 


2258 


99 


619 


Y76198 


Homo sapiens 


TJinrtan K»fr^f^A nmt^tn ^ncfldcd Iw ffCDC 

rjuiUHn scvicicu jjiuicui viiw/uw* vj gvuw tt^m 


108 


64 


620 


AF067864 


Homo sapiens 


''lit 1 111 M«%An^rM* ^ ainho 

transienin iccepior z aipna 


3922 


94 


621 


D90721 


Escherichia 
co]i 


T_ 11 m.ii.m.i i.i'U lU MVA^Mn AwfTn^ 

iXtUlSmCuUHauv praiCUi oppVif 


573 


90 


622 


W75858 


Homo sapiens 


tiuman secretory proiein oi cionc wo / jtr-j. 


730 


100 


623 


Y94982 


Homo sapiens 


Human secreted protein vbl2 1, SEQ ID N0:4. 


733 


100 


624 


AF034745 


Mus 

musculus 


LNXpSO 


637 


83 


625 


U42580 


Paiameciam 
bursaria 
Chlorella 
virus 1 


Pro-nch, IPrPNMSLrLo \^X) 


9d 


46 


626 


U79260 


Homo sapiens 


unknown 


"194 


70 


627 


R95913 


Homo sapiens 


Neural thread protein. 


99 


50 


628 


G03450 


Homo sapiens 


Human secreted protein, biJbg lu Mu: oil. 


AO*/ 


100 


629 


Y36281 


Homo sapiens 


Human secreted protein encoded by gene 58. 


jyU 


inn 


630 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


lOD 


to 


631 


G02139 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6220. 


268 


yo 


632 


U16996 


Homo sapiens 


protein tyrosine posphalase 


i31 


ov 


633 


AF121857 


Homo sapiens 


sorting nexin 7 


2019 


100 


634 


AF283772 


Homo sapiens 


similar to Homo sapiens ril)osomal protem Liu 
encoded by GenBanIc Accession Number 
L25899 


340 


/ / 


635 


Y07090 


Homo sapiens 


Renal cancer associated antigen precursor 
sequence. 


277 


"64 


636 


AB013382 


Homo sapiens 


DUSP6 


414 


lO 


637 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO; 6953. 


315 


71 


638 


M95762 


Rattus 
norvegicus 


GABA transporter 


5'Z4 


oy 


639 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: /57U. 


LVy 


Ow 


640 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 clone 
HNHF029. 


1 1f 


70 


641 


AC008075 


Arabidopsis 
thaliana 


F24J5.4 


191 


33 


642 


W74824 


Homo sapiens 


Human secreted protein encoded by gene 96 
clone HAQBJK.O] . 


615 


62 


643 


AB015982 


Homo sapiens 


serine/threonine kinase 


485 


98 


644 


Y25806 


Homo sapiens 


Human secreted protein Itagnient encoded from 
gene 7.5. 




46 


645 


AF122904 


Homo sapiens 


membrane protein DAPIO 


474 


100 


646 


AF233323 


Homo sapiens 


Fas-associated phosphatase- 1 


200 


38 


647 


W48804 


Homo sapiens 


Homo sapiens clone BK158 1 protein. 


1203 


99 


648 


AF257330 


Homo sapiens 


COBW-like protein 


1440 


98 


649 


Y36203 


Homo sapiens 


iiuman secreieu proiein » / 


233 


73 


650 


G02872 


Homo sapiens 


U.^MOM (MM«mfA/l npoftttn CPO \T\ "NO* #)0S^ 


173 


78 


651 


Y32199 


Homo sapiens 


Human receptor molecule (REC) encoded l>y 
incyte clone ivSLLt z?. 


1012 


100 


652 


AB0329O9 


Hylobates 
agilis 


dopamine receptor D4 


122 


32 


653 


AK021848 


Homo ssptens 


uimamed protein product 


186 


69 


654 


W73411 


Homo sapiens 


Human secreted protein encoded by Gene No. 
15. 


57 


37 


655 


L22455 


Rattus 
norvegicus 


mu opioid receptor 


116 


34 


656 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7193. 


no 


45 


657 


G02345 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6426. 


459 


97 


658 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 clone 
HPMBQ32. 


291 


75 


659 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6913. 


134 


65 


660 


Y91423 


Homo sapiens 


Human secreted protein sequence encoded by 
gettenSEQIDNO:144. 


333 


96 



118 



wo 01/57188 



PCTAJSOl/03800 



SEQ 1 i 
ID h 
NO: 


\oc6saon : 


Species 


Description 

Untnan cprrftteri nrotcin. SEO IDNO: 7870. 


Waterman 

Score 

168 


Identity 

isi : 


661 1 

662 


303789 
Y53886 


rlomo si^iens 
Homo sapiens 


A suppressor of cytokine signalling protein 

Heqiffnnted HSCOP'6. 

Human OTP binding protein APD08. 


375 
629 


100 


663 
664 

665 


W75771 
AL096770 

AB037734 


Homo sapiens 
Homo sapiens 

Homo sapiens 


bA150A6.2 (novel 7 transmembrane receptor 
(rhodopsin family) (olfactoiy receptor like) 
protein (hs6Ml-2ni 

VT A A 1 ^ 1 nmtein 


480 

978 
192 


55 

96 
84 


666 

667 1 


W82841 
W82841 
AB030184 


Homo sapiens 
Homo sapiens 
Mus 

musculns 


Human cerebral protein-1. 

Human cerebral protein- 1. 

contains transmembrane (TM) region and ATP 

binding region 


182 
757 


87 
68 


669 


AB032919 


Hylobates 
muelleri 


dopamine receptor d4 


85 


37 
81 


0/U 1 

671 

672 

673 1 


Z33642 
W85608 
G03203 


Rattus 
norvcgicus 
Homo sapiens 
Homo sapiens 
Homo sapiens 


outer membrane protem 

leukocyte surface protein 

Secreted protein clone du410 5. 

Human secreted protein, Sbg iv mu: urn. 


746 

394 
261 

1 vu 


93 
91 
48 
"99 


674 
675 
676 
677 


AL035587 
Y59668 
G03797 
At'026954 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Bos taurus 


dJ475N16.4(KIAA0240) . 
Secreted protein 108-005-5-0-Cl-FL. 
Human secreted protem, st^V^ lu n\Ji /o /o- 
pynivate dehydrogenase pnosphatase r^atoiy 
subunit precursor; PDPr 


2388 
1134 
174 
1013 


53 
74 
95 

"96 


678 


LI 1625 


Mus 

musculus 


receptor protein-tyrosine kinase 


545 
745 


100 


679 
680 

681 


AL031427 
AJ133430 

G02532 


Homo sapiens 
Mus 

musculus 
Homo sapiens 


dJ167A193 (novel protemj 

olfactory receptor 

Human secreted protein, SEO IDNO: 6613. 


528 
179 


77 

70 


682 
683 


G03789 
Y94943 


Homo sapiens 
Homo sapiens 


Human secreted protem, SEQ ID wu: /o /u. 
Human secreted protein clone ytl4_l protein 
sequence SEO ID NO:92. 


336 
US 


76 
100 


684 

685 


U43360 
G00885 


Peromyscus 
maniculatus 
Homo sapiens 
1^0010 sanicns 


reverse transcriptase 

Human secreted protein, SfcVj ID NO: 4966. 
imnamed protein product 


100 

T62 
590 


37 

"^0 
100 


686 
687 
688 


1 G01982 
Y92241 


Homo ss^cDS 
Homo sapiens 


Human secreted protein. SbO IDNO: 6063. 
Human cancer associated antigen precursor 
> (Ma-REN-46). ^ . 


718 
2405 


100 
99 


689 


AC024792 


Caenorhabditi 
s elegans 


contains similarity to TK:P7»J 


423 


36 


690 


Y27868 


Homo sapiens 


Human secreted protein encoded by gene wa 

107 


183 


81 

-58 — ~ 


691 

692 
693 


Y56514 

1 Y27795 
I Y36268 


Homo sapiens 

Homo sapiens 
Homo sapiens 


■ Human Jurkat cell clone P2-15A1M10 longest 
ORF protein sequence. 

" ~Tj,,_a- eor<rptAH nmtcin encoded bv sene No. 79. 

■ ■ Human secreted protem encoded by gene 45. 


T80 

1539 

428 

308 


"99 
98 
89 


694 
695 
696 
697 


|U124^S 
1 Y45272 
1 AF191838 
Y02693 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


ribosomal protein L35 
' H""^»" '^prreted protein encoded &om gene 16. 

TANK binding kinase IBKI 
" Human secreted protein encoded by gene 44 

clone HTDAD22. 


1517 
1242 
275 


■ 99 

yo 
75 


698 


Y87280 


Homo sapiens 


' Human signal peptide containing protein HSPP- 
.57SEQIDNO:57. 


576 


"90 
"99 


699 
700 


Y97999 
AJ006701 


Homo sapiens 
Homo sapiens 


' ■ Human SCAD fiamiiy molecule HSFM-1, 5>lii^ 
IDNOrl. 

" putative serine/threonine protein kinase 


729 
610 


79 


701 
702 


1 AF209198 
AJ298841 


Homo s^iens 
Mus 

musculus 


zinc finger protein 277 ^ 

torsmA protein 

unnamed protein product 


2357 
709 

""622 


100 
45 

"98 


703 
704 

705 


AK021729 
Z46787 

1 002882 


Homo sapiens 
Caenorhabditi 
s elegans 
Homo sapiens 


similar to Glutaredoxin. Zinc finger. C3HC;4 

type gUNG finger) 

" Human secreted protein, S>bUiUJ4u: 6963. 


920 

1 589 


51 
198 



119 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 



Accession 
No. 



Species 



Descripticni 



Smith- 
Waterman 
Score 



% 

Identity 



706 



707 



708 



G02301 



omo sapiens 



R9S326 



omo sapiens 



G03002 



Homo sapiens 



Y96202 



Homo sapiens 



Human secreted protein, SEQ IP NO: 65S2, 



Tumor necrosis factor receptor 1 dcalh domain 
ligand(clonc2PP). 



121 



Human secreted protein, SEQ lU NO: 7081 



125 



IkappaB kinase QKK) binding protein, y2H5fa. 



516 
131 



39 



98 
W 



710 



M63577 



Saccharomyc 
escereWsiae 



SFPT 



711 



712 



713 



714 



715 



AB026291 



Rattus 
noTvegicus 



acetoace^l-CoA synthetase 



D21211 



^omo sapiens 



protein tyrosine phosphatase (FTP-BAS, type 3) 



368 



AF044033 



Marmota 
marmota 



olfactory receptor 



615 



G03561 



Homo sapiens 



AB033062 



G00577 



Homo sapi^ 



Human secreted protein, SEQ ID NO: 7642. 
KIAA1236 protem" 



251 



1380 



Hiomo sapiens 



Human secreted protein, SEQ IP NO: 4658. 



80 



100 
100 



717 



718 
719 



Y96864 



Homo sapiens 



SEQ. ID. 37 firom WO0034474: 



AJ243396 



Homo sapiens 



voltage-gated sodium channel beta-3 subunit 



234 
578 



100 
99 



720 



721 



722 



723 



724 



725 



727 



728 



729 



730 



U47334 



Homo sapiens 



similar to chicken gamma aminobutyric aci 
receptor bcta4 subunit 



AB020598 



Homo sapiens 



Y53886 



Homo sapiens 



J05046 



Homo sapiens 



AF001958 



Arabystoma 
tigrinum 



AF127084 



Mus 

musculus 



X54673 



AF016191 



Homo sapiens 



Rattus 
norvegicus 



AB029559 



Rattus 
norvegicus 



Y28503 



Homo salens 



AJ011415 



Z93096 



Homo sapiens 



Homo sapiens 



peptide transporter 3 



A suppressor of cytokine signalling protein 
designated HSCOP-6. 



570 



insulin receptor-related receptor 



6787 



electrogenic Na+ bicarbonate cotransporter, 
NBC 



111 



semaphorin cytoplasmic domain-associated 
protein 3A 



325r 



GABA transporter 



potassium channel 



370 



BATT 



139 



HGFH3 Human Growth Factor Homologue 3. 



2186 



plexin-Bl/SEP receptor 

bK390B3.1 (manic fringe ^Prosophila) 
homolog) 



729 
142 



100 
74 



100 
41 



99 



100 



35 



68 



731 



Z10062 



Homo salens 



cDNA encoding a human vanilloid receptor 
homologue Vanilrepl. ^ 



732 



733 
734 



AF161382 



Homo sapiens 



HSPC264 



AB029033 



Homo sapiens 



KIAAl 110 protein 



3826 
592 



735 



736 



737 



738 



AE000493 



Esdicricbia 

GOU 



putative transport protem 



AL033379 



Homo sapiens 



dJ4 17022.2 (novel 7 transmembrane receptor 
(rhodopsin family) protein similar to high- 
affinity lysophosphatidic acid receptor homolog) 



2173 



AF132599 



Homo sapiens 



RANTES fartor of late activated T lymphocytes- 



245 



X55019 



Homo sapiens 



acetylcholine receptor delta subunit 



Sir 



X91906 



Homo sapiens 



voltage-gated chloride ion channel 



1978 



AB026116 



Homo sapiens 



organic anion transporter 4 



1444 
83 



99 



56 



100 



98 
24 



740 



742 
743 
744 



745 



746 



D00570 



Mus 
musculus 



open 

readmg frame (196 AA) 



W03626 
U66059 
AFl 19815 



Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo 



Human thyrotropin GPR N-tcrminal sequence. 
V_segment translation product 

G-protein-coupled receptor 

haematopoietic lineage ceil protein (AA 1*486) 



118 
614 
2751 
148 



W67838 



Homo sapiens 



Human secreted protein encoded by gpnc 32 
clone HLTCJ63. 



448 



W57260 



W21578 



Homo sapiens 



Human semaphorin Y. 



Homo sapiens 



Alzheimer's disease protein encoded by DNA 



2414 
968 



40 

100 

99_ 

93_ 

95 



100 



65 




120 



wo 01/57188 



PCTA}S01/03800 



SEQ 

ID 

NO: 


Accession 
No. 


Species 




Smith- 

Watennan 

Score 


% 

Identity 


751 




Nlus 

musculus 


aranimh i li n -a 


773 


41 


752 


Y52386 


Homo sapiens 


Unman tran^embrane orotcin HP02000. 


900 


99 


753 


Y48586 


Homo sapiens 


Human breast tumour«associated protein 47. 


2527 


99 


754 


AJ272207 


Homo sapiens 




694 


100 


755 


M85183 


Rattus 
noivcgicus 


vasopressin iwcpwi 


979 


68 


756 


AF190501 


Homo sapiens 


recepror o 


388 


71 


757 


Y02692 


Homo sapiens 


Human secreted protein encoded by gene 43 
clone HTADX17. 


TSi 


87 


758 


Z22535 


Homo sapiens 


ALK.3 


439 


98 


759 


R04932 


Homo sapiens 


Interferon-gamma receptor segment from clone 
39 responsiDieior Dmaing ine uugeu 


564 


97 


760 


W74902 


Homo sapi&is 


Human secreted protein encoded by gene 175 
done ni^Diyl. 


1217 


99 


761 


G03706 


Homo sapiens 


Human secreted protein, oiiy lu //o/. 


223 


88 


762 


AB020676 


Homo sapiens 


KIAA0869 protein 


"4433 


"99 






Homo sapiens 


unnamed protein product 


2285 


99 


764 


AF173358 


Homo sapiens 


glucocorticoid receptor AF-1 coactivator-1 


573 


100 


765 


AF268066 


Mus 

musculus 


netrin4 

. — 


2019 


89 

"85 


766 
767 


Y48585 
AF230378 


Homo sapiens 
Mus 

musculus 


Human breast tumour-associated protein 40. 
interleokin-1 delta 


309 


45 


768 


AF121975 


Mus 

musculus 


odorant receptor S18 




"62 




An008515 


Homo sapiens 


RanBPM 


611 


57 


770 


Y09945 


Rattus 
norvegicus 


putative inl^gral membrane transport protein 


458 
688 


50 


• 771 
772 


AF226731 
Y27132 


Homo sapiens 
Homo sapiens 


AD026 

Human glioblastoma-denvca poiypepuae ^cionc 
OA004FG). 


1384 


100 


773 


X87832 


Homo sapiens 


NOV/plexin-Al protein 


1821 


98 


774 


AB0252S8 


Mus 

musculus 


granuphilin-a 


500 


41 


775 


AF12510I 


Homo sapiens 


HSPC040 protein 


232 


93 


776 


G02815 


Homo sapiens 


Human secreted protein, SEO ID NO: 6896. 


314 


95 


777 


G02493 


Homo sapiens 


Human secreted protein, ba^l inu. oj 
Sequence of pre-human atrial natriuretic peptide. 


191 
213 


68 
45 


778 
119 


R03301 
AL357374 


Homo sapiens 
Homo sapiens 


bA353C18.2 (novel protein) 


232 


100 


780 


AF100346 


Homo sapiens 


neuronal voltage gated calcium channel gamma- 
3 subunit 


1434 


89 


781 


Y19566 


Homo sapiens 


Amino acid sequence of a human secreted 
protein. 


103 


52 


782 


Y36233 


Homo sapiens 




1098 


93 


783 


AF084464 


Rattus 
norvegicus 




141 


30 


784 


W49042 


Homo sapiens 


Human low density lipoprotem bmdmg protein 
LBP-3. 


2693 


99 


785 


AF238381 


Homo sapiens 


PTOVl 


1904 


91 


786 


Y91870 


Homo sapiens 


Human apoptosis related protein. 


547 


100 


787 


Y71062 


Homo sapiens 


Human membrane transport protein, MTRP-7. 


1062 


94 


788 


AFU7754 


Homo sapiens 


thyroid homione receptor-associated protein 
complex component TRAP240 
dJ37C 10.3 (novel ATPase) 


8684 
"2848 


98 
96 


7*9- 
790 


AL049569 
AF151848 


Homo sapiens 
Homo sapiens 


CGI-90 protein 


745 


96 


791 


Y08639 


Homo sapiens 


nuclear oiphan receptor ROR-beta 


1421 


95 


792 


Y41706 


Homo sapiens 


Human PR0381 protein sequence. 


644 


99 


793 


AF121228 


Homo sapiens 


thyroid hormone receptor-associated protein 
complex component TRAP95 


1037 
124 


100 
62 


794 
795 

796 


G04072 
Y69384 

W40215 


Homo sapiens 
Homo sapiens 

Homo sapiens 


Human secreted protein, SEQ ID NO: 8153. 
Amino acid sequence ot a 14274 receptor 
protein. 

Human macrophage antigen. 


119 

1358 


100 
"99 



121 



wo 01/57188 



PCT/USOl/03800 



SEQ 
ID 
NO: 
797 


Accession 
No. 

AF258340 


Species 
Homo sapiens 


Description 

hepatocellular carcinoma-associated antigen 1 12 


Smith- 

Waubiiiiiui 

Score 

1151 

461 


Tdentiiv 

99 
98 


798 
799 
800 


AF159615 

Y59863 

W70459 


Homo sapiens 
Homo sapiens 
Homo sapiens 


-GF receptor activating protein 1 

ixmm normal uterus tissue derived protein Z(>. 

Human Tl-receptor Ugand HI splice variant 2. 


797 
1913 


99 
93 


801 
802 

803 

804 


L00073 
P92219 

X15357 
W64473 


Homo sapiens 
Homo sapiens 
(human) 
Homo sapiens 
Homo sapiens 


renm 

CRl protein. 

"ANP-A receptor preprotcin (AA -32 to 1029) 
Human secreted protein from clone EC 1 72 1 . 


11963 

5199 
4018 
2067 


97 

98 
95 
100 


805 
806 
807 
808 


AJ243474 
G01731 
Z24680 
AF171669 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


oligophrenin-4 

Human secreted protein. SEQ ID NO: 5812. 
garp 

glyooprotem-associated amino acid transporter 
LAT2 

Secreted protein CC198 1. 


284 
1562 
1364 

1154 


100 

83 

90 

"96 


809 
810 

811 


W70321 
W74843 

AF108831 


Homo sapiens 
Homo sapiens 

Homo sapiens 


Human secreted proteb encoded by gene 1 15 
clone H0VBAd3. 
K:CI cotransporter 3 


855 

4561 


"99 
100 


812 
813 

Olt 


AF092135 
AF283772 

G01563 


Homo sapiens 
Homo sapiens 

Homo sapiens 


PTO014 

similar to Homo sapiens ribosomal protein LIO 
encoded by GenBank Accession Number 
L25899 

Human secreted protein, SEQ ID NO: 5644. 


862 
784 

330 


100 
100 

100 


815 
816 

817 
818 
819 


AF051151 
W95630 

G01082 

AF151800 

L00352 


Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 


Toll/interleukin-l receptor-like protein 3 
Homo sapiens secreted protein gene clone 
gnll4 1. 

Human secreted protein, SBO ID NO: 5163. 
CGI-41 protein 

low density lipoprotein receptor 


3850 
358 

549 

1106 

3980 


99 
100 

100 

95 

100 


820 
821 
822 
823 


X04434 
G03844 
AF212220 
Y50125 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


IGF-I receptor 

Human secreted protein, SEQ ID NO: 7925. 
TERA 

Human glycophosphatidylinositol-anchorcd 
protein GPI-122. 


5832 
572 
396 
4897 

2675 


^9 
100 
48 
99 

98 


824 
825 


AF156778 
AF096322 


Homo sapiens 
Homo sapiens 


ASB-3 protein 

neuronal voltage-gated calcium channel gamma- 
2 subunit 


1105 


100 


826 
827 


Y07972 
AB032013 


Homo sapiens 
Homo sapiens 


Human secreted protein fragment #2 encoded 
from gene 28. 
poi&ssiuin cnannci rwYo.i 


1540 
2435 


100 
95 


828 

829 

830 


Y13620 
Y91474 

X54232 


Homo sapiens 
Homo sapiens 

Homo sapiens 


BCL9 

Human secreted protein sequence encoded by 

gene24SEOIDNO:147. 

glypican 


5284 
541 

1625 


96 
98 

87 


831 
832 
833 


XI 4830 
Y71262 . 
G03873 


Homo sapiens 
Homo sapiens 
Homo sapiens 


acetylcholine receptor beta-subunit preprotein 
Human cbondromodulin-like protein, Zchml. 
Human secnsted protein, SEQ ID NO: 7954. 
R29828 1 


2540 
1002 
638 
1389 


100 
100 
96 
93 


834 
835 
836 


AC003030 

Y38422 

U41557 


Homo sapiens 
Homo sapiens 
Caenorhabditi 
3 elegans 


Human secreted protein, 
glycine-rich 


■ 964 
85 


87 
36 


837 

838 
839 


AL121889 

AJ011415 
W80398 


Homo sapiens 

Homo sapiens 
Homo sapiens 


dJ1076E17.1 (KIAA0823 protein (continues in 

AL023803)) 

plexin-Bl/SEP receptor 

A secreted protein encoded by clone cwl543 3. 
Human secreted protein, SEQ ID NO: 4943. 


998 

1580 
1105 

255 


75 

oO 
67 
92 


840 
841 
842 
843 

844 
845 
846 
847 


G00862 
G02650 
AF036717 
Y73446 

G02872 
AF151810 
X83378 
AO004883 


Homo sapiens 
Homo s£q)iens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


" Human secreted protein, SEQ ID NO: 673 1 . 
FGFR signalling adaptor SNT-1 
Human secreted protein clone yc27_l protein 
sequence SEQ ID N0:1 14. 
Human secreted protein. SEQ ID NO: 6953. 
CGI-52 protein 
putative chloride channel 
similar to general transcription fector 21; similar 


644 

2629 

1089 

357 
1443 
1620 
■ 655 


97 
99 
100 

69 
88 
99 
■ 96 



122 



wo 01/57188 



PCTAJSOl/03800 



SEQ 
IL) 

NO: 


Accession 
Kin ■ 


Species 




Smith- 

Waterman 

Score 


% 

Identity 








to AF038Q69 fPID '22827207^ 












mtinne^Mt^ rhpmntflctic Drotein-2 


160 


76 


849 




Homo S8.piens 


Qimilnr tn mfttiv olfflctoA' receotOF 13l Similar to 
P34984 n^ID£464305) 


963 


98 


o50 




lioino Sapiens 


C\ nrntpin_r»min1pH fecentor C5L.2 


1767 


100 


851 


AF 124490 


Homo sapiens 


ARF GTPase-activating protein GUI 


3415 


98 


852 


186217 


Homo sapiens 


Unman e»/>n»*<»H nnitpm T4WHniI54 SEO TD 

tiuman secrcica proiwij iiTf ffi\ju«rtj, uuv^ 
Vi\J, uz. 


1189 


99 


853 


AF224741 


Homo sapiens 


ctiloride channel protein 7 


3748 


99 


854 


XI 7094 


Homo sapiens 


lurm (AA i-7y4j 




00 


855 


W78245 


Homo sapiens 


Fragment ot numan sccreiea proiem cucoaca oy 
gene 19. 




00 


856 


R97569 


Homo sapiens 


I nterjeuKm-2 receptor associaxea proiem p43 . 




inn 


857 


Y41765 


Homo sapiens 


Human PRO1083 protein sequence. 




00 

77 


858 


AF057306 


Homo s^iens 


transmembrane proteolipid 


481 


84 


859 


AK025116 


Homo sapiens 


unnamed protein product 




07 


860 


Y413I2 


Homo sapiens 


Human secreted protein encoded by gene 5 clone 
HLDRM43. 


824 


^ nn 
lUU 


862 ' 


Y25776 


Homo sapiens 


Human secreted protein encoded irom gene 66. 


895 




863 


Y74188 


Homo sapiens 


Human prostate tumor EST fragment derived 
protein #375. 


96 


30 


864 


AF167473 


Homo sapiens 


heme-binding protein 


870 


yy 


865 


G02532 


Homo sapiens 


Human secreted protein, SJiQ ID NU; bbJ3. 


211 


0/ 


866 


X54870 


Homo sapiens 


Type n integral membrane protein 


12U1 


lUU 


867 


G00700 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4781. 


o40 


77 


868 


Y07894 


Homo sapiens 


Human secreted protein fragment encoded from 
gene 43. 


ioo 


SSI 


869 


J00123 


Homo sapiens 


prcprocnkcphalin ( 


134y 


yj 


870 


Y91632 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 25 SEQ ID NO:305. 


1048 


98 


871 


L04311 


Homo sapiens 


GABA-alpha receptor beta-3 subunit 




07 

yj 


872 * 


Y29988 


Homo sapiens 


Human cytokine family member EF-7 protein. 




QA 

y*t 


873 


AF161382 


Homo sapiens 


HSPC264 


1 1 1/1 
i \IA 


00 
77 


874 


G03412 


Homo sapiens 


Human secreted protem, SEQ ID NO: 7493. 


HD4- 


inn 


875 


Y27572 


Homo sapiens 


Human secreted protein encoded by gene No. 6. 


573 


96 


876 


Ml 5530 


Homo sapiens 


B^ccll growth factor 


171 


DO 


877 


W63681 


Homo sapiens 


Human secreted protein 1. 


1652 


OO 

yy 


878 


L27867 


Rattus 
norvegicus 


neurexophilin 


1 AAQ 


09 

yo 


879 


Y10835 


Homo sapiens 


Amino acid sequence of a human secreted 
protein. ■ 


321 


100 


880 


W88991 


Homo sapiens 


Polypeptide fragment encoded by gene 144. 




100 


881 


AF 118670 


Homo sapiens 


orphan sj protem-^upiea recepior 


1071 


100 


882 


AF208865 


Homo sapiens 


EDRf 




100 


883 


Y18462 


Homo sapiens 


cathepsin L 


209 


72 


8S4 


Y94y50 


Homo sapiens 


Jnuman secreica piuivui i»iuuc uhav/ j_i*> |iiuiwiu 
sequence SEQ ID NO:106. 


348 


too 


885 




Homo sapiens 




404 


100 


886 


Y04315 


Homo sapiens 


Uiimati earwtwH nimtl^in MlfWl^H ViV OMld V\ 

iiuman SCCrciCO pxuicin cijiajucu uy gwiic 


385 


too 


887 


X92744 






375 


100 


888 




Homo si^iens 


UiMvton B^t*wtf^ nmtpin cMiMMicfi clraifi 
rTtllTP"" SvvFwlwu proiciu si^jumiim vivu» 

rn^71 a 


994 


94 


889 


Y41293 


Homo sapiens 


Human soluble protein ZTMPO-l . 


4595 • 


99 


890 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7795. 


147 


63 


891 


AF208856 


Homo sapiens 


BM-014 


1012 


99 


892 


U29195 


Homo sapiens 


neuronal pentraxin 11 


2002 


98 


893 


X6S149 


Homo sapiens 


Burkitt lymphoma receptor 1 


1953 


100 


894 


Y949U 


Homo sapiens 


Human secreted protein clone pw337_6 protein 
sequence SEQ ID NO:34. 


537 


100 


895 


W61630 


Homo sapiens 


Clone HNFGW06 of EGFR receptor family. 


326 


63 


896 


M24110 


Homo sapiens 


G0S19-2 peptide precursor 


481 


100 


897 


Z68747 


Homo sapiens 


imogen 38 


2018 


99 


898 


AF186112 


Homo sapiens 


neuiokimn B-Iike protein ZNEUROKl 


619 


100 


899 


AF22S420 


Homo sapiens 


AD025 


734 


100 



123 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 

900 

901 

QfYJ 

903 


Accession 
No. 

P60657 
M27288 

G01349 


Species 

iomo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


Desdiption 

Sequence of human lipocortin. 
oncostatinM 

Polypeptide witli transmembrane domain. 
Human secreted protein, Sbg ID NO: 5430. • 
Human secreted protein encoded by gene 4. 


Smith- 
Waterman . 
Score 
1835 
1297 
749 
650 
1133 


% 

Identity 

100 

99 

100 

99 

99 


904 
905 
906 
907 
908 
909 


Y00261 

AF039688 

AB007836 

AB017507 

AK000056 

Y86299 


Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Honio sapiens 


antigen NY-CO-3 
iic-5 
Ai)el2 

unnamed protein product 
"Human secreted protein khOXBii, SEQ ID 
NO:214. 


771 

2544 

224 

1537 

427 

7393 


99 

100 

100 

98 

100 

99 


910 
911 


AF231023 
Y14134 


Homo sapiens 
Homo sapiens 


protocadherin Flamingo 1 

Vascular endothelial cell growth inhibitor beta 

protein sequence. 


1319 


100 


912 

913 
914 
915 


Z90420 

Y19757 
G03172 
U14971 


Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 


Human GDF-3"(hGDF-3) polypeptide encodmg 
•cDNA. 

■ SEO ID NO 475 from W099222«. 

Human secrMco proiem, scrv lu nv. /^jj- 

ribosomal protein S9 


1950 

1361 
112 

886 


100 

100 

48 

90 


916 
917 
918 
919 

920 
921 
922 
923 
924 


AF172854 
AC005525 
AF166350 
Y87285 

Y36131 
AFl 93766 
Y95013 
X75208 
Y96202 


Homo sapiens 
Homo sapiens 
Homo ssyiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


cardiotrophin-like cytokme CLC 

F22162 1 
ST7 protein 

Human signal peptide containing protein HSPP- 

62SEQIDN0.62. 

Human secreted protein #3. 

cytokine-like protein CI 7 

Human secreted protein vc48_l, SEQ ID N0.66. 

protein tyrosine kinase-reccptor 

IkappaB kinase (IKK) binding protein, Y2H56. 


1204 
1963 
4711 
430 

"465 
724 
357 
5256 
813 


99 
100 
99 
100 

88 

100 

100 

1 An 
lUO 

98 


925 
"926 
927 
928 
929 
930 

931 
932 


AB039886 

G03368 

Y48606 

Y36151 

AFl 10399 

AF210317 

Y73328 
G01959 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 


down-regulated in gastnc cancer 

Human secreted protein, SEQ ID NO: 7449. 

Human secreted protein #23. 

elongation factor Ts ^ 

facilitative glucose transporter fiumily member 
GLUT9 

uTRV/i rinnp Dfotein seouencc. 
Human secreted protein, SEO ID NO: 6040. 


785 

55 

539 

668 

1666 

2763 

931 
274 
1469 


78 

50 

100 

100 

100 

99 

100 
100 
100 


933 
934 
935 
936 


U47924 
G03827 
AB039371 
X56385 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Cams 
familiaris 


Human secreted proteiiu SEQ ID NU: /yu». 

mitochondrial ABC transporter 3 

raoo 


529 
196 
"1054 


93 
63 
100 


937 


B08906 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 16 SEQIDNO:63. 


117 
1064 


"44 


938 
939 . 


M13692 
Y53886 


Homo sapiens 
Homo sapiens 


alpha-1 acid glycoprotein precursor 

A suppressor of cytokine signalling protein 

He<:ifmated HSCOP-6. 


515 


42 


940 


Y16630 


Homo sapiens 


' Human Putative Adrenomedullin Receptor 
(PAR). 


1904 


■^9 


941 

942 


AC005102 
M12886 


Homo sapiens 
Homo sapiens 


small inducible cytokine subfiamily A member 
24 

T-cell receptor beta chain 


627 
1289 


""59 
81 


943 
944 


AF226046 
Y36078 


Homo sapiens 
Homo sapiens 


GK003 

Extended human secreted protein sequence, bbQ 
ID NO. 463. 


104y 

667 
565 


100 
100 


945 
946 


M22877 
W67869 


Homo sapiens 
Homo sapieas 


cytochrome c 

Human secreted protein encoded by gene 63 
clone HHGDB72. 


551 


93 


947 

948 
949 
950 


W67859 

W85726 

An42015 

G04075 


Homo salens 

Homo sapiens 
Homo sapiens 
Homo sapiens 


Human secreted protein encoded by gene 53 

clone HBMCL41. 

Novel protein (Clone BG33 7). 

cMDC II protein 

■ Human secreted protein, SliO ID NO: 8156. 


"183 

789 
4236 
■ 567 


100 

100 
100 
' 99 



124 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 

951 

952 



953 



954 



955 



956 



957 



958 



959 



960 



961 
962 



963 



Accession 
No. 

AF 11064 5 
Y36111 



Species 



Homo sapiens 
Homo sapiens 



AB012109 



iomo sapiens 



AF246221 



Homo sapiens 



AF054986 



[omo sapiens 



W74726 



Homo sapiens 



Y27096 



AJ222967 



Homo sapiens 



Y53052 



G02694 



AF 15 185 5 
U26592 



AL050306 



AF0788S9 
AB020315 



Descnption 

candidate tumor suppressor p33 INGl homoiog 
■ sxtended human secreted protein sequence, SEQ 

ID NO. 496^^ ___ 



omo sapiens 



Homo sapiens 



Homo sapiens 
'Hlomo sapiens 



Homo sapiens 



APCIO 



transmembrane protein BRj 



" mutative transmembrane GTPase 



' tiuman secreted protein fg949J^ 

. , ^ t^i^/Ai 



Human viral receptor protein (ACVRP). 



cystinosin . — 

Human secreted protein clone dtZOZ J protem 
SEQIDNO:110. 



sequence — ^ t-«<— 

Human secreted protein. SEQ ID NO: 6775. 



CGI-97 protein 

diabetes mellitustype I autoantigen 



dI475B7.2 (novel protein) 



PTD004 

homologue of mouse dkk-1 gene:Acc 
AF030433 



Smith- 
Waterman 
Score 
1314 



990 



1405 



1883 
1879 



1581 
1920 



35r 



283 



1214 
250 



2089 
l46r 



% 

Identity 



100 
100 



100 
100 



100 
100 



100 



96 
65 
100 



100 
100 



1 964; 
965 



966 



967 



jomo s^iens 



X04571 



Homo sapiens 



AF146019^ 
10 



aomo sapiens 
[omo sapiens 



precursor poivpeptidc (AA -22 to 1185) 



6580 



h epatocellular carcinoma antigen gene 520 
minK-rclated peptide 1; MiRPl 



AB021227 
AF180920 



membrane-type-S matrix mctalloprotcinase 



3545 



99 



99 



100 
100 



969 
970 



971 
972 



973 



974 



975 



976 



977 



978 



Homo sapiens 



cyclin L ania-6a 



AF10536S 
AF083248 



AJ132429 



iomo sapiens 
iomo sapiens 



iomo sapiens 



K-Cl cotransporter KCC4 

ribosomal protein L26 homoiog 

hyperpolarization-activated cyclianucleotide 
gated cation channel hHCN4 



5621 
739 



6295 



W61619 
AF155100 



Homo sapiens 



rinTi e HTPEF86 of TM4SF superfamily. 
"zinc finger protein m-R^N-^i anngen 



454 



AF275948 



Homo sapiens 



ABCAl 



11763 



AB026891 



Homo sapiens 



AFl 17657 



Homo sapiens 



cystine/glutamate transporter 

thyroid hormone receptor-associated protein 

complex component TRAP80 

neural membrane protein 35; NMP35 



2552 



3348 



99 

100 

100 



100 
100 



99 
100 



99 



979 



980 



981 



982 



986 
987 



AF044201 
AFl 19297 



Rattus 
porvegicus 
Homo sapiens^ 



neuroendocrine-specific protein-like protein 1 



1170 



AFl 55652 



Homo sapiens 



W88499 



Homo sapiens 



Z56281 
AB026125 

T 



Homo sapiens 
Ho mo sapiens 
io sapiens 



potassium channel modulatory factor 
Human stomach carcinoma clone HP10412- 
encoded protein. 



1983 



1553 



AB02388 8 
W27291 



Homo sapiens 
Homo 



interferon regulatoiy factor 3 
ART-4 

Fragment o^ human secreted protem enco 
gene 1 7. 



b-chemokine receptor CCM" 

Human H1075-1 secreted protein 5' end. 



1895 

712 

22r 



92 



99 
99 



100 
100 
32 



988 



989 



990 



AFI33450 



Manduca 
sexta 



juvenile hormone esterase binding protem 



G03697 



Homo sapiens 



Human secreted protein, SEQ iu isu: njo^ 



194 



AF204159 



Homo sapiens 



potassium large conductance calcium-acuvaiea 
channel beta 3asubunit 



1486 



G02061 
AL031266 



Human secreted protein. SKQ lUNO: 6142. 



558 
327 



88 
100 



99 
40 



991 
992 



993 
994 
995 
996 



997 
998 



999 



Caenorhabditi 



VM106R.1 



Y66749 
G01246 
AF133845 
AFl 17756 



Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 



Membrane-bound protein PROl 124. " _ 
Human secreted protein, SEQ ID NO: 5327. 



conn 



W62066 
Y87173 



thyroid hormone receptor-associated protein 
complex component TRAP150 



Homo sapiens 
Homo sapiens 



V13379 



Homo sapiens 



Human stem cell antigen 2. 

Human secreted protein sequence SEQ ID 

NO:212. , 

Amino acid sequence oi protein PR0263 



Y95008 



Homo sapiens 



1001 



AF190167 



Human secreted protein vQ. 1, SbQ lu mu^ 
iociated protein SLP-2 



membrane t 



676 
1747 



93 
100 



99 



47 



125 



wo 01/57188 



PCT/USOl/03800 



SEQ 
NO: 


Accession 

ISO, 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 




nni7^d 




Human secreted protein, SEQ ID NO: 5315. 


398 


96 




W73420 


Homo saniens 


Human secreted protein encoded by Gene No. 
24. 


2150 


100 




Y19791 


Honio sspicns 


1 9kD SRP-protein (AA 1-144) 


742 


100 


inns 




Homo saoiens 


membrane protein 


642 


100 


1006 


X63745 


Homo sapiens 


KDEL receptor 


326 


98 


1007 




nomo diipiciis 


Extended human secreted protein sequence, SEQ 
TDNO 382 


824 


99 


lOOo 






rlnn&mine receotor D4 


92 


35 


lOOy 


VO 1 £.Qr\ 

yyioou 


UntY^A eat\ipnc 
XluniO St^ICIlD 


Human secreted protein se(}uence encoded by 
pene 81 SEC ID NO:353. 


1372 


99 


1010 


ATI o/;n< 


rioiiiu Sapiens 


HT^fMRId 1 ^novel orotein'^ 


825 


98 


lOlI 


(jUi73i 


Homo sapiens 


Human (lecreted orotein. SEO ID NO: 7814. 


379 


98 


1012 


Y17531 


Homo sapiens 


Human secreted protein clone BL205 14 protein. 


818 


97 


1013 


G00724 


Homo sapiens 


Unman e«t«*i^h>H nmtpin iri NO* 480^ 


462 


100 


1014 


AF288092 


TsJaegleria 
gruberi 


haem lyase 


114 


37 


1015 


AB045292 


Homo sapiens 


MoJ prOiCm 


3867 


99 


1016- 


X15940 


Homo sapiens 


ribosomal protein L31 (AA 1-125) 


644 


100 


1017 


Y94873 


Homo sapiens 


Human protein cione xtruzo^z. 


1876 


100 


1018 


AL024498 


Homo sapiens 


QJ417MI4.1 (novel proiein; 


589 


100 


1019 


X83425 


Homo sapiens 


Lutheran blood group glycoprotein 


3054 


"99 


1020 


W03516 


Homo sapiens 


Prostaglandin DP receptor. 


1864 


100 


1021 


G03960 


Homo sapiens 


Human secreteo protem, oca^ uj jnu. ouhi. 


398 


100 


1022 


Y91689 


Homo sapiens 


Human secreteo pioiem se(|uence encouca oy 
gene w osixi uj n\j.jw^. 


768 


100 


1023 


AE000660 


Homo sapiens 


hADV36Sl 


573 


100 


1024 


AF 132965 


Homo sapiens 


CajI-3 I proieiD 


1550 


100 


1025 


W92380 


Homo sapiens 


Human TR-interactiJig protein S103a. 


1466 


97 


1026 


R66278 


Homo s^iens 


Therapeutic polypeptide fiom glioblastoma cell 
line. 


ft30 


100 


1027 


X65614 


Homo sapiens 


SI OOP calcium-binding protein 


476 


100 


1028 


Y4174I 


Homo sapiens 


Human PRO704 protein sequence. 


1323 


100 


1029 


AJ00I014 


Homo sapiens 


RAMPl 


806 


100 


1030 • 


W63682 


Homo sapiens 


Human secreted protein 2. 


1354 




1031 


AK023007 


Homo s^iens 


unnamed protein product 


766 


100 


1032 


W97900 


Homo sapiens 


Human SR-BI class B scavenger. 


2672 


99 


1033 


Y82453 


Homo sapiens 


iiuman i sccrcioiy proicut scv ^ 


639 


99 


1034 


Y73473 


Homo sapiens 


Uiimati cffrrvktp/l nmt^in rlnnp vri17R 1 nrotcin 
cAniimci* 9PO in NO* 1 68 


752 


93 


1035 


Y86468 


Homo sapiens 


iJimian omf^ AR>raMvleH nfYitfiin fV&ement. S&O 
IDNO:383. 


96 


90 


1036 


U09813 


Homo sapiens 


mitochondrial ATP synthase subunit 9 precursor 


698 


100 


1037 


AJ 242832 


xiomo sapiens 


vaipcUli 


3699 


99 


1 AOO 






acetylcholine receptor epsilon subunit CHRNE 


2574 


100 


1039 


AJ242730 


Homo sapiens 


polyhomeotic 2 


1310 1 


100 


1040 ' 


At 109968 


MUS 

musculus 


hfiiHina nfntpin DPISR^P 

UlH A OllUlUlg JJlUICUl I 


1453 


SO 


1041 


X52563 


Bos taurus 


pcnnability increasing protein 


383 


29 


1042 


uOOioo 


Homo sapiens 


Miimfin cprrptnl nmtein SRO ID NO' 4449. 


75 


50 


1043 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6613. 


60 


53 


1044 


M94582 


Homo sapiens 


interleukin 8 receptor B 


1850 


100 


1045 


AL080239 


Homo sapiens 


bG256022.1 (similar to IGFALS (insulin-like 
grow^ factor binding protein, acid labile 

subunit)) 


1704 


50 


1046 


AF125101 


Homo sapiens 


HSPC040 protein 


580 


100 


1047 


W74809 


Homo sapiens 


Human secreted protein encoded by gene 8 1 
ckmoHMWDNSZ. 


176 


100 


1048 


AL022238 


Homo sapiens 


dJ1042K10.4 (novel protein) 


2201 


100 


1049 • 


W88667 


Homo sapiens 


Secreted protein encoded by gene 1 34 clone 
HAIBP89. 


1559 


99 


1050 


AF097518 


Homo sapiens 


liver-specific transporter 


2820 


100 



126 



wo 01/57188 



PCT/USOl/03800 



SEQ 

tt) 

NO: 


Accession 
No. 


Species 


Description 


Smith- 

Watennan 

Score 


70 

Identity 


1051 • 


W78324 


Homo sapiens 


Fragment of human secreted protein encoded by 
gene 81. 


1318 


98 


1052 


Y21851 


Homo sapiens 


H\iinan signal peptido-contianing protem (5IGP) 
(clone ID 2328134>. 


1643 


95 


1053 

1054 
1055 


AL163815 

Y76200 
AJ276567 


Arabidopsis 
thaliana 
Homo sapiens 
Homo sapiens 


putative protein 

Human secreted protein encoded by gene 77. 
TClO^DceRlioGTPasc 

Human secreted protein encoded by gene No. 54. 


661 

2oZ 

1160 

154 


"62 

inn 
100 
96 


IIII III 


Y27620 
D14530 
AF132000 
AL031778 

AF227135 

Y27575 

Z11697 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 


ribosomal protein 

TADAl protein 

receptor (peripheral) (MBR, PBR, PBKS. BBP. 

T..MA>itAAtSn»-KtnHincr ntYltein)') LUCE DlOteUlJ 

candidate taste receptor T2R9 

Human secreted protein encoded by gene No. 9. 

putative transmembrane protrin 


745 

1132 

920 

134 
1392 
1088 
819 


100 
100 
100 

33 
100 
100 
100 


1063 
1064 
1065 
1066 
1067 


AF123757 

AF155135 

Y41674 

AJ250042 

Y36087 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


novel retinal pigment epithelial cell protein 
Human channei-relatea moiecuie iw-iuvx-a. 
Rab5 GDP/GTP exchange factor homologue 
Extended human secreted protein sequence, SEQ 
ID NO. 472. 


2932 
936 
2575 
770 


99 
99 
100 
83 


1068 


Y94959 


Homo sapiens 


Human secreted protein clone mc300_l protein 
sequence SEQ ID NO:124. 


301 


Too 


1069 

1070 
1071 
1072 
1073 


Y94959 

W64535 
X03145 
AL031177 
X82200 


Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


Human secreted protein clone mc3O0_l protein 

sequence SEQ ID NO:124. 

Human icuKocyic ceii cionc rxruuow pivu^w. 

DOtORFUI 

dJ889M15.3 (novel protem) 
gpStafSO 


301 

2014 

148 

821 

249 


100 

"99 
50 
91 
62 


1074 
1075 
1076 
1077 
1078 

1079 
10X0 


G03213 
Y36233 
G03187 
L25899 
Y91447 

GO 1862 
AB039723 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 7294. 
Human secreted protein encoded by gene 10. 
Human secreted protein, SEQ ID NO; 7268. 
ribosomal protein LIO 

Human sccreieo proicin sciiuwiuc wiuuutu uj 

Ecne48SEQIDNO:168. 

Human secreted protein. SEQ ID NO: 5943. 

WNT receptor frizzle(i-3 


99 

506 

424 

332 

898 

290 
1376 


47 
55 
98 
76 
97 

89 
92 


ilil UU 


AB020527 

L13802 

W75098 

G03564 
G04063 
AF090942 

Gooin 


Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


Na/P04 cotransportcr homolog 
ribosmal protein small subunit 
Human secreted protein encoded by gene 42 
clone HSXBI25. 

Human secreted protein, SEQ ID NO: 7645. 
Mnmnn cpnvted nrotcbL SEQ ID NO: 8144. 
PRO0657 

" Human secreted protein, SEQ ID NO: 4598. 


269 
499 
143 

83 
88 
124 
129 


100 

80 

81 

51 
43 
64 
41 




G04091 

AF140631 

G04063 

S72304 

W88708 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Mus sp. 
Homo sapiens 


■ Human secreted protein, SEQ ID NO: 8172. 

■ "Human secreted protein, SEQ ID NO: 8144. 
LMWG-protein 

Secreted protein encoded by gene 175 clone 
HEMAM41.. 


126 
364 
114 

146 . 
405 

4358 


36 
82 
32 
83 
100 

97 


1093 
1094 


W85612 
Y53012 


Homo sapiens 
Homo sapiens 


Secreted protein clone fhl23 5. 

Human secreted protein clone pm514_4 protein 

sequence SEQ ID NO:30. 


1013 


99 


1095 

1096 
1097 


Y92345 

AF090942 
L24521 


Homo sapiens 

Homo sapiens 
Homo sapiens 


" Human cancer associated antigen precursor from 
clone NY-REN-fi2. 
PRO0657 

transformation-related protein 


409 

147 
' 166 


100 

60 
" 58 


1098 
1099 
1100 


X56932 
G04063 
Y02693 


Homo sapiens 
Homo sapiens 
Homo sapiens 


23 kD highly basic protem 

" Human secreted protein, SEQ ID NU: 8144. 

Human secreted protein encoded by gene 44 
' cloncHTDAP22. . 


490 

83 

149 


70 
35 
' 59 



127 



wo 01/57188 



PCT/USOl/03800 



SEQ 
ID 

1101 


Accession 
No. 

AFl 19851 


Species 
Homo sapiens 


DescnptiOQ 
PR01722 

Human secreted protein, SEQ lU Nu: 5io/. 


Smith- 
Waterman 

Score 

183 

207 


% 

Identity 

72 
62 


1102 
1103 
1104 

1105 
1106 


G04086 
G04063 
X74856 

G03789 
G03133 


Homo sapiens 
Homo sapiens 
Mus 

musculus 

Homo sapiens 
Homo sapiens 


Human secreted protein. SEQ ID HO: 8144. 
ribosomal protein LZS 

Human secreted protein, SEQ ID NO: 7870. 
Human secreted protein, SEQ ID NO: 7214. 


91 
128 

130 
122 


52 
69 

62 
48 


1107 
1108 
1109 


G03040 

AF039942 

AF201951 


Homo sapiens 
Homo sapiens 
Homo sapiens 


Human secreted protein, btg lu rsu: /izi. 
HCF-binding transcription factor Zhangfei 
high affinity immunoglobulin epsilon receptor 
betasubunit 


744 
738 


43 
99 
94 


1110 

nil 


AFl 11 108 
AFn9900 


Mus 

musculus 
Homo sapiens 


transient receptor potential 2 

PR02822 

A protein that interacts with presenilins. 


144 

265 


59 
39 


1112 
1113 
1114 


Y16589 
G02872 
Y02999 


Homo sapiens 
Homo sapiens 
Homo sapiens 


"Human secreted protein, SEQ ID NO: 6953. 
Fragment of human secreted protein encoded by 
gene 12h ^ ^ 


178 
164 

fan 
1217 


67 
63 


1115 

ni6 

1117 


Y30811 
X51394 

M27826 


^omo si^iens 

Xenopus 

laevis 

Homo sapiens 


Human secreted protein encoaea irom genis i. 
APEG precursor protein 

neutral protease large subunit 

Human secreted protein, SEC ID NO: 7452. 


130 

442 

72 


40 

65 
60 


1118 
1119 
1120 

1121 
1122 
1123 


G03371 
G03602 
y35906 

G03714 
Y00337 
AF084830 


Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 


Human secreted protcm, SEC ID NO: 76S3. 
Extended human secreted protein sequence, skq 
ID NO. 155. 

Human secreted protein, SEQ ID NU: 7795. 
Human secreted protein encoded by fienc 81. 
two pore domain K+ channel; TASK.-Z 
mcvnKranfk tntprarttnc nrotein of RGSIo 


491 
244 

122 
110 
703 
442 


97 

Ql 

y/ 

65 

on 

94 
88 


1124 
1125 
1126 
1127 
1128 


AFl 12862 

W64469 

G01361 

G01361 

Y84320 


Homo sapiens 
Homo sapiens^ 
Homo sapiens 
Homo sapiens 
Homo sapiens 


Human secreted protein from clone CW79S_2. 
Human secreted protein, SEQ ID NO: 5442. 
Hiimnn «pr«»ted nrotein. SEO ID NO: 5442. 
Human cardiovascular system associated protein 
kinase- 1. 

Human secreted protein, SEQ ID NO: 61 bb. 


191 
154 
165 
815 

88 


53 
100 
100 
99 

73 


1129 
1130 


G021O5 
Y32923 


Homo sapiens 
Homo salens 


Tmncmpmhrnne domain containing piotein clone 
HP01512. 

Human synapse related glycoprotein 2. 


700 
260 


100 
91 


1131 
1132 


Y29817 
Y91644 


Homo sapiens 
Homo sapiens 


Human secieted protein sequence encoded by 
Ecne43 SEQIDNO:317. 


525 


96 


1133 
1134 


Y91449 

AB0n908 • 


Homo sapiens 
Homo sapiens 


Human secreted protein sequence encoded by 

gene49.SEQIDNO:170. 

4F2 light chain 


542 


100 
93 


1135 
1136 

1137 


X51760 
G03790 


Homo sapiens 

XlDUlO bapiClld 

Homo sapiens 


zinc finger protein (583 AA) 

Human PRO1604 (UNQ785) amino acid 

sequence SEQ ID NO:308. 

Human secreted protein, SEO ID NO: 7871. 


312 
917 

102 


55 
72 

50 


1138 
1139 


AF155106 
AL031035 


Homo sapiens 
Homo sapiens 


NY-REN-36 antigen 

dJ28H20.1 (novel protein similar to membrane 
transport proteins) 

regulator of G-protcin signaling 7 


768 
117 

138 


91 • 
50 

--96 


"1140 
1141 


AF011359 
Y70018 


Bos taurus 
Homo sapiens 


Human Protease and associated protcin-12 
(PPRG-12). 

■ Human secreted protein, SEQ ID NO: 8172. 


' 623 
113 


100 

38 


1142 
1143 


G04091 
AB030235 


Homo sapiens 

Canis 

familiaris 


D4 dopamine receptor 


89 


48 


1144 

1145 
1146 
1147 
1148 
1149 


Y94922 

X99962 
G03807 
G03712 
Y28279 
1 U13642 


Homo sapiens 

Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 
Caenothabditi 


Human secreted protein clone pv6_l protein 

sequence SEQ ID NO:50. 

rab-related GTP-bmdmg protein 

Human.secreted protein, SEO ID NO: 7888. 
" ■ Human secreted protein, SEQ ID NO: 7793. 
" Human G-protein coupled receptor uRR-l. 
" exon 5 similar to transmembrane domain of S. 


539 

398 
168 

512 
705 
1 247 


88 

"96 

79 
85 

' 76 
1 36 
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wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 



Accession 
No. 



nso 



G03438 



TTsT 



G01003 



1152 



1153 
1154 



1155 



1156 



1157- 



1159 



1160 

nsr 



1162 



1163 



TTST 



1165 



G03798 



Species 



Description 



s elegans cerevisiae zinc resistance protejn 



Homo sapiens 



cergvisiac au^ itouw ivw — 

Human secreted protein, SEQ iL> NO: 7m 
— . . fi¥->r^ TIN 'KTrv» gnPit 



Homo sapiens 



ruiniau 5c<;re tc«J 

H..n,«„ ,.rr^^ nrotein. m) ll> NUTsm 



X88799 
D85245 



Homo sapiens 
Qryza sativa 
Homo sapiens 



fiuniall s cctcicu ytvi-wH "■"■^ 

Himan secreted protein, SEQ iU NO: 7879^ 



DNA binding protein 
TR3beta 



D8D:Z4D I nonw »8yi^i*a a*ww^.*. 

tt74^72 Homo sapiens T timoiir suppressor protem. p53 
Y862 65 Homo sapiens ' Hiimati sccreteo protcm HUSXl 



Human secreted protcm HUSXE77. SBQ ID 
NOrlSO. 



G02577 i Homo sapiens Human secreted protein, SEQ iDNO: 6658. 
AF104334 — [ HcMnn qflniens putative oigamc anion transports 
r — u^mati cpcTiEted protein. SEQ ID iN 



G01393 



W75771 
AF2 16833 



Homo sapiens 
Homo sapiens 



p utative Olgamc «nion ironatA/iy 

Human secreted protein, SEQ iDNO: 5474. 

r~T7:.— rzTO KinHfna nmteiil APD08. 



Human GTP binding protein APP08 
M-ABC2 protein 



W67816 



Homo sapiens 



Human secreted prot^ encoded by gene 10 
clone HCEMU42. 



AFl 19851 rHomo sapiens 



Y87252 



Homo sapiens 



PR01722 . , , 

Human signal peptide contaming piotem HSI'f- 
29SEQIDNO:29. 



Hnman liver cell done HI^Ul l48protein^ 



Homo sapiens 
Homo sapiens 



HC6 



Smith- 
Waterman 



117 



181. 



198 



95 
155 



341 



173 
224 
410 



1156 



230 



113 



338 
134 



% 

Identity 



62 



80 



63 



41 
96 



87 



57 



81 

S3 



100 



70 
31 



82 
64 



1167 



Y14482 



Fragment of human secreted protein encodeo oy 
gene 17^ 



1168 



D90789 



Escherichia | Dipcptidc transport system pemeasc protem 
coU _ I DPPO' 



1169 
1170 
1171 



R63783 
Y45274 
D64154 



Homo sapiens 
Homo sapiens 
Homo sapiens 



1 TG0847 protein. ..... 

I Human secreted protein enco ded from gene 1 8. 
I Mr 110,000 antigoi 



1172 



AB026256 



Ho mo sapiens I organic ani on transporter UA ir-o 



344 

347 
311 



1173 



G00357 



1174 



D87717 



Homo sapiens 



Human secreted prote'm. SEQ iU Nu: ^oc^ 



Homo sapiens 



"iimilar to human GTPase-acnvaimg 
protein(A49869) 



1175 
1176 



M64716 
R08330 



Homo sapiens 
Homo sapiens 



I ribosomal protein 
Human IL'7 receptor clone H6 



1177 



L06505 — I Homo sapiens ribosomal protem Li2 



1178 



AJ251885 



Homo sapiens organic cation transporter (QCT2) 



1179 



G032S8 



Homo sapiens 



Human secreted protein, SEQ IDNO: 7339. 



1180 



G01207 



1181 



AF181856 



Homo sapiens 

Rattus 

norvegicus 



Human secreted protein, SEQ ID NO: 5288. 
tRNA sclcnocystemc associated protein 



AF161524 
G03789 



Homo sa^iiens 
Homo sapiens 



1184 



Y02671 



Homo sapiens 



HSPCt76 ^ 

Human secreted protein, SEQ ID NO; 787 

"Human secreted piotem encoded by gene 22 



1183 



1186 



1187 

TisT 



1189 



1191 



1193 



1194 



1195 



1196 



1197 



1200 



clone HMSJW18. 



G03797 



Homo sapiens 



Human secreted protein, SEQ ID NO: 7878. 



G03564 
AB032905 

G00956 



G03258 



Homo saq)iens 
Hylobates 
concoloT 
Homo sapiens 
Homo sapiens 



"Human secreted protein, SEQ iD NO: 7645. 
dopamine receptor DT 



Human secreted PTOtein. SEQ ID NO: 503 /." 



G03361 
AFl 17755 

Y70455 



Homo sapiens 
Homo sspiens 

Homo sapiens 



H uman secreted protein. SHQ ID NU: /339. 
Human secreted protein, SEQ IDNO: 7442. 
thyroid homione receptor-associated protein 

complex component TRAP230 . 

Human membrane channel protcin-5 (MEuHP- 

5). 



G03052 



Homo sapiens 



Human secreted protein, SEO IDNO: 7133. 



G02607 



Homo sapiens 



Human secreted protein, SEQ ID NO: 6^»». 



W29661 



Homo sapiens | Homo sapiens Cb42 2 clone secreteo pT5tein. 



Y14104 



X61972 



Homo sapiens 
Homo sapiens 



Human GABAB receptor Id protein sequence. 



macropain subunit iota 



G00534 



Homo sapiens 
flomo sapiens 



Human secreted protein, SEQ iD NO: 4615. 

I protein HELriN47, SBQ ID 



NO:175. 



002607 



Homo sapiens | Human secreted proteia SEQ ID JNu: 6688^ 



178 



391 
285 



242 



276 



155 



282 
249 



138 
282 
107 



88 



118 



292 



178 



324 
187 

2or 



99 



192 



2001 



239 
TW 



145 



90 
98 
96 



67 
S2 
59 



78 
67 



72 



88 



71 



90 



90 
66 
71 



69 
46 



78 



[tT 

70 
"ST 



42 
76 



98 
69 
90 



51 



129 



wo 01/57188 



PCTAJSOl/03800 



SEQ 1 ^ 
ID 1 
NO: 
^201 


{Recession 
No. 

G00838 


Spedes 

Homo sapiens 


Description 

Huiian secreted orotein, SEQ lU NO: 4919. 


Waterman 

Score 

404 


% 

Identity 
50 


1202 
1203 


Y73424 


Homo sapiens 


neutral protease large subunit 

Human secreted protein clone yi4 J protem 

sequence SEQ IDNO:70. . ^ .- 


202 
265 


49 
61 


1204 

1205 
1206 


AF264014 

y36203 
U78111 


Homo sapiens 

Homo sapiens 
Callus gallus 


scavenger recqptor cysteine-rich type I protein 

M160 precursor 

Human secreted protein #75. 

AO 


625 

219 
205 
416 


98 

59 
57 
76 


1207 
3208 
1209 
1210 


AF095448 
AF116715 
AF099137 
AF205718 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


putative G protein-coupled receptor 
PR02829 

'lepatocellular carcinoma-related putative tumor 
suppressor ^ 


127 
475 
423 


75 
95 
79 


1211 
1212 


Y27868 
G00719 


Homo sapiens 
Homo sapiens 


"Human secreted protein cncodea oy gene No. 

1 A7 

l\Jf. 

"Human secreted protein. SfciQ iU isu: ^cU). 
Tj.M«.An oi»/«rof«ri nmtmn SEO ID NO.' 5090. 


224 

117 

351 


70 

44 

73 


1213 
1214 
1215 

1216 
1217 


G01009 

AF090942 

Y14427 

G03905 
Y57897 


Homo sapiens 
Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 


PRO0657 

^uman secreted protein encoded ty gene 1 7 

clnneHSIEA14. 

Human secreted proiem, or.v »^^- 
Human transmembrane protem H l MFW-^x. 


124 

99 

173 
1173 


70 
77 

57 
100 


1218 
1219 1 

IJmJ 


J00194 
Y59709 

Wol ^ / u 


Homo sapiens 
Homo sapiens 
Homo s^^iens 


hla-dr antiRcn alpha chain 
Secreted protein 76-28-3-A1/-FL1. 
EBV-induced G-protein coupled recqjtor (EBl- 
2) polypeptide. ^ , 


454 
470 
725 


78 
92 
100 


1221 


W96745 


Homo sapiens 


High affinity immunoglobulm E receptor-like 
protein aOERB'^- 


650 


98 


1222 

1223 
1224 


Y359n 

Y00278 
AF161422 


Homo sapiens 

Homo sapiens 
Homo sapiens 


ID NO. 160. 

Human secreted protein encoded by gene 1 1 . 
HSPC304 . 


135 

260 
568 


31 

95 
90 


1225 
1226 
1227 


1 U14970 
1 G01733 
AF099973 


Homo sapiens 
Homo sapiens 
Mus 

musculus 


ribosomal protein S5 

Human secreteo proiem, or.v^ v>*\j. jo it. 
schlafeii2 

^iSan secreted protem. SfcQ ID NO: 5299. 


202 
610 
333 

155 


95 

100 

56 

81 


199R 

1229 
1230 


1 n01218 
AF217188 

AF176813 


Homo sapiens 
Mus 

musculus 
Homo sapiens 


YIPIB 

soluble adenylyl cyclase 


801 
275 


63 
100 


1232 


X98333 
W74955 


Homo sapiens 
Homo sapiens 


organic cation transporter 

Human secreted protein encoded by gene ! f 

clone HOEAS24. 


1704 
212 


100 
53 


1233 


Y94940 


Homo s^iens 


Human secreted protem clone yi62_l protem 

«*^niiftnr<- ^FO ID NO*86. 


526 
"482 ■ 


100 
"82 


1234 
1235 


U76618 
AF044924 


Mus 

musculus 
Homo sapiens 


N-RAP 

hook2 protein 
" "Human secreted protein. SEQ ID NU: 554U. 


380 
4J / 


97 
inn 


1236 
1237 
1238 


1 G01459 

AF000018 
I W88633 


Homo sapiens 
Homo sapiens 
Homo sapiens 


adapter protein 

Secreted protein encoded by gene 100 clone 
HF8EU04 


164 
250 

697 


■ 84 
90 

98 


1239 
1240 

1241 


W29660 
AF004I61 

Y92710 


Homo sapiens 
Oryctolagus 
cunicuJus 
Homo sapiens 


Homo sapiens CH27 1 clone secreted protem, 
peroxisomal Ca-dependent solute carrier 

' Human membrane-associated protem Zsig24. 


154 
709 


52 
97 


1242 
1243 


1 Y95002 
■ y44905 


Homo sapiens 
Homo sapiens 


" Human secreted protein vc34 1. SEQ ID NU:44. 
Human potassium channel molecule ERG-LP2 

partial protein. 

" cation-chloride cotransporter-interacting protein 


908 

323 

511 


88 
100 

97 


1244 
1245 

1246 
1247 


AF284422 
Y53629 

1 AB039371 
1 Y35911 


Homo sapiens 
Homo sapiens 

Homo sapiens 
Homo sapiens 


" A bone marrow secreted protein designated 

BMS115. 
" mitochondrial ABC tranq?orter 3 
" Extended human secreted protein sequence, SEQ 


1888 

389 
1 1^8 


93 

_97 



130 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

JD 

NO: 


/Accession 
No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identity 


1248 


AF072509 


Rattus 
Dorvegicus 


ID NO. 160. 

glutsunate rec^tor interacting proiem ^ 
tandem pore domain potassium channel TRAAK 


559 

661 


90 

98 


1249 
1230 


AF247042 
B08974 


Homo sapiens 
ioroo sapiens 


Human secreted protein sequence encoded by 
Kene27SEQIDNO:131. 


1087 


97 


1251 


115313 


Caenorhabditi 
sclcgans 


putative 


358 


59 


1252 

1253 
1254 
1255 


y29338 

W01730 
G03074 
G01818 


Homo sapiens 

Homo sapiens 
Homo sapiens 


Human secreted protein clone it2l7_2 alternate 

reading frame protein. 

Human G-protein receptor HPRAJ70. 
"Human secreted protein. SEQ ID HO: 7155. 
"Miimfln oecreted nrotein, SEQ ID NO: 5899. 


278 

211 
294 
253 


75 

92 
83 
91 


1256 
1257 
1258 
1259 


AK286368 
AF220264 
002227 
Y07970 


Homo sapiens 
Homo sapiens 
Homo sapiens 
Homo sapiens 


eppin-1 
MOST-1 

"HiiTnnn QRcrptcd oroteiiL SEO ID NO: 6308. 
': -luman secreted protein fragment #2 encoded 
from gene 26. ^ , 


222 
87 
281 
81 


54 
93 
78 
94 


1260 
1261 


R95332 
AF140674 


Homo sapiens 
Homo s^iens 


'Fi<m#\f nAr«r/\cic fartnr i^centoi' 1 deflui domain 

ligand (clone 3TW). 

zinc mctalloprotease ADAMTS6 


986 
172 


100 

36 


1262 
1263 


U28369 
Y07049 


Homo sapiens 
Homo sapiens 


semaphorin V 

Renal cancer associated antigen precursor 
sequence. 

Human secreted protein 425. 


237 
288 

187 


67 ^ 
71 

80 


1264 
1265 


Y36153 
Y78114 


Homo sapiens 
Homo sapiens 


Human cytokine signal regulator CKSR-2 SEQ 
1DN0:2. 

Amino acid sequence ot protein PR0334. 


723 
191 


93 
100 


1267 


Y 13397 
AF030558 


Homo sapiens 

Rattus 

norvegicus 


phosphatidylinositol 5-phosphale 4-kinase 
gamma 


859 
159 


95 


1268 
1269 


U73167 
AF190664 


Homo sapiens 
Mus 

musculus 


canaioate cumor suppr»aof « 
LMBR2 


552 


76 


1270 
1271 


AL050332 
G02126 


Homo sapiens 
Homo sapiens 


dJ570F3.1 (homolog ot the rat synaptic ras 
GTPase-activating protein pl35 SynGAP) 
"Human secreted protein, SEQ 0) NO: 6207. 


820 
131 


98 
95 


1272 
1273 


AF125533 
AL03566I 


Homo sapiens 
Homo sapiens 


NADH-cytochromc b5 reductase isoform 
dJ568Cl 1.3 (novel AMF-binding enzyme 
cimiinr tn flcetvl-coen2yme A syntfaetfaase 
(acetate-coA ligase)) 


253 
1280 


92 
100 


1274 


AF064748 


Mus 

musculus 


S3-12 


"353 


61 

78 


1275 
1276 

1277 


D17554 
Y30715 

AF146760 


Homo sapiens 
Homo sapiens 

Homo sapiens 


TAXREB107 

Amino acid sequence of a human secreted 
protein. 

scptin 2-Iike cell division control protein 


643 
707 


90 
100 


1278 
1279 

1280 


Y05D69 
X59668 

G01051 


Homo sapiens 
Oiyctolagus 
cuniculus 
Homo sapiens 


Human PIGR-2 protein sequence. 

aorta CNG channel (rACNG) 

" Human secreted protein, SEQ ID NO: 5132. 
' Human secreted protein, SEQ ID NO: 74»2. 


267 

489 
120 


46 
85 

98 
43 


1281 
1282 
1283 


uUj41i 

AF055084 
■Atll*8l4 


Homo sapiens 
Mus 

muscuiusi 


" very large G-protein coupled receptor-l 
odd-skipped related 1 protein 


" 1635 
357 


100 
98 


1284 


U87318 


Xenopus 
laevis 


NaDC-2 


535 


60 

"68 


1285 


AF061346 


Mus 

musculus 


Edpl protein 


■■?52 


■"68 


1286 


AB030182 


Mus 

musculus 


contains transmembrane (TM) region 


582 




1287 


A13595 


synthetic 
construct 


tmimmosuppresive protem FF15 
ser/arg-rich pre-mRNA sphcing tactor SR-Al 


185 

837 


97 

100 


1288 
1289 


AF254411 
AP084205 


Homo stqjicns 

Rattus 

norvegicus 


serine/threonine protein kinase TAOl 


319 


98 



131 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 



Accession 
No. 



AF038563 



Species 



Homo sapiens 



Description 



Watennan 

Score 



membrane associated guanylate li 



523 
468 



1291 



AF034837 



Homo sapiens 



double-stranded RNA specific adenosine 
deaminase 



1292 



1293 



Ml 5888 



Bos taurus 



endozcpine-related protem precursor 



AB010692 



Arabidopsis 
thallana 



ATP-dependent RNA helicase-UKe protem 



1294 



1295 



1296 



1297 



1298 



1301 



1302 



AF209923 



Homo sapiens 



orphan G-protein coupled 



W67828 



Homo sapiens 



Human secreted protein encoded by gene 22 



504 



clone HFEAF41. 



AC004832 



Homo sapiens 



X80035 



Oiyctolagus 
cuniculus 



similar to 45 kDa secretory protein ; similar to 
CAAI0644.1 (PID:g4164418) 



64r 



cysteine rich hair keratin associated protein 



575 



G02645 



Homo sapiens 



Human secreted protein, SEQ ID NO: 6726. 



1299 Y59440 



1300 W70504 



Homo sapiens 



Human delta3 fragment fi". 



223 
122 



Homo sapiens 



Leukocyte seven times membrane-penetrating 
type receptor protein JEOl 8. 



459 



Y67315 



Homo sapiens 



Human secreted protein BL»»_i3 amino acid 
sequence. 



3916 



M77693 



Homo sapiens 



speimidine/spermine is l-acetyltransfer 



174 



•Rnnifln secreted protein. SEQ IV NO: 5412. 



254 



1303 
1304 



1305 



1306 



1307 



1308 



1309 



1310 



1311 



1312 



1313 



G01491 



Homo sapiens 



Human secreted pr otein, SEQ ID NO: 5572. 



AF148509 



Homo sapiens 



G01658 



Homo sapiens 



Y90899 



Homo sapiens 



alpha l^»mannosidase 

Human secreted protein, SEQ ID NO: 5739. 



602 



333 



Dl-like dopamine receptor activity modifying 
protein SEQ ID N0:1. 



332 



AF033120 



Homo sapiens 



p53 regulated PA26-T2 nuclear protein 



348 



Y73388 



Homo sapiens 



HTRM clone 3376404 protem sequence. 



147 



AF063243 



ribosomal protein L30 



AF224494 



Mus 

musculus 



arsenite inducible RNA associaiea proiein 



296 
688 



Y73342 



Y99419 



Homo sapiens 



Homo sapiens 



HTRM clone 2709055 protein sequence. 
Human PRO1780 {UNQ842) amino acid 



1154 
1145 



sequence SEQ ID NO:282. 



1314 



1315 



1320 



AFl 16667 



Homo sapiens 



PR01777 



W75100 



Homo sapiens 



Human secreted protem encoded by gene 44 



807 




D90892 



Escherichia 
coli 



SORBITOL^PHQSPHAlb 
DEHYDROGENASE(EC 1.1.1.140) 
(GLUCITOL-6. PHOSPHATE 
DEHYDROGENASE) (KETOSEPHOSPHATE 
REDUCTASE). 



1321 



1322 
1323 
1324 



1325 



1327 



W67847 



Homo sapiens 



Human secreted protein cn< 
clone HPBCJ74. 



Looded by gene 41 



AJ276101 

AJ27610L 
y58628 



Homo sapiens 
Homo sapiens 
Homo sapiens 



U91561 



Rattus 
norvegicus 



GPRC5B protein 

GPRC5B protein 

Protein regulating gene expression PROE-21. 
pyridoxine 5*-phosphate oxidase 



466 
504 
1584 



1277 



AF125533 



Y32206 



Homo sapiens 



NADH-cytochrome b5 reductase isoforoi 



Homo salens 



Human receptor molecule 
lncyte clone 2S25826 



(REC) encoded by 



1606 
1531 




Y13583 
AF078866 



Homo sapiens 
Homo sapiens 



G-protein coupled receptor 
SURF-4 



2171 
195 
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SEQ 
ID 
WO: 
1334 


Accession 
No. 

Y25755 


Species 

Homo sapiens 


Description 

iiitmofi cprn»tpH nrntein encoded from fiCOe 45. 


Smith- 

Watemian 

Score 

1380 

4742 


% 

Identity 

96 
99 


1335 
1336 
1337 

1338 


AF152325 

X74070 

AF093927 

G03877 
AL008582 


Homo sapiens 
Homo sapiens 
Rattus 
norvegicus 
Homo sapiens 
Homo sapiens 


protocadherin gamma A5 
transcription factor B l t i 
protein phosphatase 2C 

Human secreted protein, SEQ ID NO: 7958. 
bK223H9.2 (ortholog of A. thaliana F23F1 .8) 
leukemia inhibitoiy factor receptor 


639 
1931 

621 
626 
5820 


81 
95 

100 
100 
99 


1340 
1341 
1342 
1343 


X61615 
Y01519 
AF207600 


Homo sapiens 
Homo sapiens 
Homo sapiens 

norvegicus 


A carcinogencsis-inhibiting protein. J 
ethanolamine kinase 
GTP-binding protein 


7528 
2372 
1167 


97 

100 

97 


1344 


AC020579 


Arabidopsis 
thaliana 


putative phosphoribosylfbrmylglycmamidinc 
synthase; 25509-29950 


3283 
~944 


51 
100 


1345 
1346 

1347 


Y28576 
W74787 

M55542 


Homo sapiens 
Homo sapiens 

Homo sapiens 


Secreted peptide clone pe503 1. 

Human secreted protein encoded by gene 58 

clone HHFHN61. 

guanylaie binding protein isoform 1 


1171 

2636 
1329 


100 

87 
100 


1348 
1349 
1350 


AF183428 

U70669 

■AF295530 


Homo sapiens 
Homo sapiens 
Homo sapiens 


28.4 kDa protein 
Fas-ligand associated factor 3 
cardiac voltage gated potassium channel 
modulatory subunit 


167 
562 


24 
99 



TABLES 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted ' 

beginning 

nucleotide 

location 

correspond! 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino,acid sequence (A«Alanine COysteinc, 
I>^\spartic Acid, E=Glutamic Acid, 
p=Phenylalaninc, G=Glycine, H=Histicline, 
Msoleucine^ K=Lysine, L=Leucine, 
M=4v!cthionine, N=Aspaiagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Thrconine, V«Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *«Stop codon. 
/=possible nucleotide deletion. >F=possible 
nucleotide insertion 


1 


nil 


A 


2 


337 


1 • 


1?SUHQAPTPCPAGLWU/PPNGHYHGS»PCiC 
HWPQAPHRA***GLLPPRWLGHGLPGGPAAP 
WAASQWVDGVAGRLPGPAWSWHASGAAPA 
OPGPL*LLVPGSSQLPDPRDP 


2 


1352 


A 


27 


100 


"366 " 


IRNSSIRPNDCERETKLSAKHMITCSASYDlRCrL 
QIETnYHHTPIRMAKIQKT/GHHQC**ECGAT 
GTLIHGWWGCKWEPLGKTVWQIPK 


3 


1353 


A 


40 


3 


314 


. HASAHASVVLKDNSELEQQLGATGAYKAKA 
LELEAEVAEMRQMLQLEHPFVNGADKLRPD 
SMYVIdaLNEL*QSLVENMIXTVVDTHyRTPI*R 
SCNYTLALILFL 


4 


1354 


A 


74 


2 


292 


TASALFSCPDGGSLAGFAGRRASFHLECLKK 

QKDRGGDISQKTVLPLHLVHHQVAHTFQQAT 

VTCQQARQSPO^RTNPE/ALQWVLPVSDGWH 

VLPLP 


5 


1355 


A 


78 


114 


850 


ENCRVASNLPGVFFSEDTAQSGSYMRISAHl'i' 
NAGGEVSNGPKRKLTLMLNFSLPSSGLNAGA 
FYALSTLLNRMVIWHYPGEEVNAGRIGLTIVI 
AGMLGAVISGIWLDRSKTYKETTLWYIMDT 
GGAWWCYTFYLGTGDTCX5*CFITAG\TMGFF 
MTGYLPLGFEFAVEL\SYPESEGISSGLLNISA 
QVFGIIFTISQGQIIDNYGTKPGNIFLCVFLTLG 
AALTAFIKADLRRQKANKETLEN 


6 


1356 


A 


81 


97 


376 


' EWFSYMLGSNMSVYHSP*SLEH.CKVLSES'A 
YLRVPFIRlLLNAR*IRKAYKRMSLEIKLLimE 
♦CLFOEMGLSLQWLYSARGDFFRATSRL 


7 


1357 


A 


93 


2 


872 


TLSSACLIGDAWKELTIVAGAVSNQLLVWYJ' 
ATALADNKPVAPDRRISGHVGIIFSMSYLESK 
GLLATASEDRSVRIWKGGDLRVPOQRVQNIG 
HCFGHSARVWOVKLLENYLISAGEDCVCLV 
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SEQID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQID 1 
NO: of 1 
peptide 
seq- 
uence 


Vtet 
^od 


3EQ 

[DNO: 

tn 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to laSL aniuiu 
acid residue 
of peptide 
sequence 


Amino add sequence (A«Alanine C^Cysteinc. 
DMspartic Acid, EKrIutamic Acid, 
F^Phenylalanine, G*=Glycine, H<4^dine, 
I=lsoleucine, K«Lysine, LF^Leucinc, 
M^^Mediionine, N'^Aspaiagine^ F^Pioline, 
Q=Glutamine, R=Arginine, S=Scrinc, 
T=Thrconine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon, 
A^ossible nucleotide deletion, V=possible 
nucleotide insertion 














WSHEGEILQAFRGHQGRGIRAIAAHERQAWV 

ITGGDDSGIRLWHLVGRGYRGLG/DLGSLLQ 

VP**ARYTQGCDSGWLLATAGSD*YRGPVSL 

♦RRGQVLGAAARG*TFPVLLPAGGSSWSRGL 

RIVCYGQWGRSCX}GCPHQHSNCCCGPDPVS 

WEGAOLELGPAWL 


8 


1358 


A 


106 


3 


350 


TSSLLSGRISTLRDETGAILIDGDPAACAPHKF 
LLTEELHLRGVSIYVLRHEAQIYGITPLWCAL 
LI/CRRL*SDSCMRAALNDRGLYQ\a-ILDGLV 
OCLGFVDSDSRKMVSTLT 


9 


1359 


A 


115 


49 


186 


QAWAJFKGKYKEGDTGGPAVWKTRLRCALN 
KSSEFNEGPERERMDV 


10 


1360 


A 


123 


2 


1249 


KGCRTQEKVDRTEVIRTCINPVYSKLFTVDFY 

FEEVQRLRFEVHDISSNHNGLKEADFLGOME 

CTLGQIVSQRKLoivoLL-ftjlvjW i aur^oi i v in 

EELSGNDDYVELAFNARKLDDKDFFSKSDPF 

LEffRMNDDATQQLVHRTEVVMNNLSPAWK 

SFKVSVNSLCSGDPDRRLKCrVWDWDSNGK 

HDFIGEFTSTFKEMRGAMEGKQVQWECINPK 

YKAKKKjmKSGTNaLNLCIOHKMHSFLDYI 

MGGCQIQFTVAIDFTASNGDPRNSCSLHYIHP 

•»r/-\T»xTT3VT VA1 "\/AvrtFipnnYnsnKMFPAFGF 

YQPNEYLlvAL V A V VJCIV^VI/ 1 uounxvirt xj* 

GARIPPEYTDSHDFAINFNEDNPECAGIQGW 
. EAYQSCF\PKAPTFTGPTNICPHSSRKVAKFRR 
SEGN»HQGRAFAHFILVDPGQVGVYSQDMGP 
DNPGGHFV 

■ A A T5i>r»T T noTVPfWrrvnOT T OGFLKGYSKT 


11 


1361 


A 


147 


614 


9 


ACARK.QLJ-rvjK 1 vr iwr y\j\^Liij\j\jEiusk\j a uiv* 

NTTSSRPASSRGVTLSSSSSSSSSLTKDALPSSL 

KSDSTTITSGLVFPFRSLCVNPAKSSVSESVSSI 

KILLSSSVKYLE*KRTSCCFPDSSESKLSQLSS 

DERVSMGTSSRKPTNSSSSLGALKMSATS\*G 

OPGPQPTPTTPT TfJI OSPPSTRPREPGLTTAKNS 

TTLTRDC 


12 


1362 


A 


177 


12 


416 


TffS^ALDSLVDPRVRSKKQPFVIYPVYDTAI 
DTKIHFSLLDGNVGEPDMSAGFCPNHKAAM 
\n cT WfiTPVODFLLHLLEGGFLPDLRAA 
ASLDT/AEIGAMDFLLS^LFTLCLMMFFFIYPFI 

NLLTMNVY 


13 


1363 


A 


249 


535 


105 


■ WTFHRHLSPAPLIVCDQGTCWSYYPQNIvg 
MPDTQMEQGLN/HLFLDGNA*PHSVECYCPS 
TFEIAIKTTSFVLYFHRYRAPEVLLRSSVYSSPI 
DVWAVGSIMAELYMLRPLFPGTSEVDEIFKIC 
OVLGTPKKVSTLVPKLL 


14 


1364 


A 


254 


572 


201 


" YLLTXIGNLMMLLVINADSCLRTXM'FfLUil 
FFFLDICYSSVTAQDAAEFPVS*KPILVWGYrr 
♦SFFFIFSWGTNGCLLSAITYACYAAICHPIXS 
TNmiNRPLCTATVNATl^MGFLNSQVN 


15 


1365 


A 


257 


425 


68 


■ THAKFU^KKFNIPKLVnJKLVYlVlUWllUMi 
AIEFLLECDQNITvKLICENT'ia^lAWSr&JUv 
TFTPIET*HPVKQMIKWQ»LTAWLRNRGYKKI 
KOTPNSETAPSVCRNLVFDKCG 


16 


1366 


A 


263 • 


104 


" 481 


■ ' FCEFRTTEEDRGGDDCWSVWTKQRNNSCVK 
SKDVFSKPVNIFWALEESVLGVKARQPKPFFA 
AGNTFENfTCKVSSKNIKSPRYSVLIMAEKPV 
GDLSSPNETKYUSLDQDSWKLENWTDASRV 


17 


1367 


A 


298 


68 


208 


- - Rmw^aSioaFEHFK^^ 

VSSLAMKEMLTKTTM 


18 


1368 


A 


300 


904 


1 


- LWGrrGTRHHARVlF'lFLVETGFPHVGQAOL 
ELLTSGDPPALASOSAGrrGMSHCARPKGHFO 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
ueace 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D»Aspartic Acid, EF-Glutamic Acid, 
F=Phenylalauine, Q=Glycine, H=Histidine, 
I=IsoIeucine, K=Lysine, I>Leucine, 
M=Metbioninc, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serme, 
i^inreoninc, v — vaunc} vv***ji/pLU|jiiai*, 
Y«Tyrosine, X«Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possibie 
nucleotide insertion 














IHLK*MFYTMSQKMP*PTI>iLILLLIIPGNLNIF 

TrTTXTii//^\irr r»Di^TAi7V7*lirrM3Vl QntPPAKftROR 
KPNMGWLOrlx 1 Ar V * ivUb V l^vjlr r /\i\.vjiv\-.fs. 

WK»DY*C/LQEVTDPIMEKGKKKKRTASFFK 

C3QPHQSTNALLRRCVR*RYHLS\TVETAGLP* 

KNTGHIPGQPFLFKLVFKC*NVICI**QYKW»Q . 

NIGVKNKSFCPH'SSSPSL^FIGHHSRNF/CSFK 

TEPHSWQAGGQWRNLSSLQAPPPGLMPLSR 

ISLMSSWDYRRPPQ 


19 


1369 


A 


302 


3 


445 


NSPSRWAKIQMFEHTFCG*GCG/ER/NVHIHCS 

WICRLRPLLWRAVREYLSKLKNAELSFDPGV 

SLLRIYAIDMPTSI»DEKEAIXFAFLAJFHE*HC 

KSRIWAV1Q/CIHLWDWLRKL*CFHRMKFYA 

AV*NKPRHLLSHIWKDVQNE.LK 


20 


1370 


A 


304 


1 


1339 


■ FFFCGKEVPLFEQNKHPGPRATTSPGAmAKA 
LLSAGEFTAGVGLSP^AIHSFVWLCmQHGA 
GGPCHQPGGSPGPWMHITQAGHLWEGAYPG 
GSSTWHQVPGQLGGSWGPRERSLLGSFIKCSP 
CPHPPGFRLWMSPNQKPPTENPGVMGRVWR 
LMPGESPLIWEAEGKEDHLSPEGQGHSEiT»VA 
PLHSSLGNTVKP*PKNQKPKQNRSRHGQ\GF 
MAGQGQSRPAAR^-PPCFALTPASHSAGTWPP 
R1CRTVPGGPCPSPSGFRSCRR*GFSA*TRSWP 
DAEPPSTPDTAPRCCTQSDTSSQGPQ*S*WRR 
CRALPGRLCSAPAAGLRBARPRLSESRRGNSP 
PASPAAASARCPSWGPSCPARPPSRPAAGTEP 
AAPSRCTAWLRGEREPGPRPPGRRPRSGRGP 
VSFAPEVLSLPAVRQTKSWRWRNEEEITRPW 
ALVRSRGG 


21 


1371 


A 


326 


799 


1587 


"GSQVLPPP,PSQDSATLPQDA*GPRAAPGgi'Vt; 
E*GLQGAGVRRLRGEVLCQPQP*GAL»EQCLP 
HLSFSPRQGAAPDTEPSAWGPAPTGATGPGLP 
LRHVRLFSAGAPRGAATPCPPALLHuPAWi'J' 
ARPMFRGHPPVRPLGPWGKVAAGPRALCLA 
GVPAVQGECATKPSG*GL*PAHLRGPPGPEVL 
QWHWQLSAGRDPVPAEDPPL»EGPLGPGGPA 
AAQAEPGADPEPEDKDQAAESRPAGAMSLSA 
OGSGPVGGQGLR 


22 


1372 


A 


327 


146 


652 


■PHLENPHPEHSFPGAPLT*STLSWSILSPREP5P 
r> A •or»vnr»xioiJT mJDiTT PHI 1 TWRTVTW55TL.L. 
PGAPCYPEHPHLEHPLTWSTPHLEHPSPGEPL 
SCRTPTRSILHRDHPLP*CLSTEESPI*GWGSLP 
ADDOTPT VI nvAPPOPOPASSCPGRDSCYSVP 
GTWSP 


23 


1373 


A 


348 


397 


2 


" CIVSSCQGTRKPCHLEDANKINKQSPTLEKIES 
T r^cCT * wn*i TVAFICWOILHPRKKYFORPL 
NNEKRKMKKiaCEEKKKCRERMQRRSKWRR 
EEKKE*RREE\EERKKEKEDRKERRKETSPRG 
SRRLLRD 


24 


1374 


A 


362 


170 


352 


" GRALiyrAAGSPVQTAHGLPSDALAPLDDSMF 
WEGRTTAOWSLHRKRHLARTLLVSRVRGPQ 


25 


1375 


A 


384 


373 


128 


■ YLlTTE.ETGYLWKNRHSDQ*KR'rHNPERUQM 
KYPKVDFCKSNSMKNRLCNKWHWTNWIFTD 
• KKINLNLKPHTKLTPNIKKN 


26 


1376 


A 


397 


383 


" 165 


■ ■ EVKNTNPFIFSGTNLTlWIRSI»RKSDEiNgK J JS. 
♦MEKYSISUDRRlJin^KMSFLPm-rYKFNTISl 

KIPANF 


27 


1377 


A 


406 


103 


380 


KSKATGYMVNI*KUV\FLYANDEQLEIEiyiNK. 
IWNFNGSKNKIAFmTKYQNIQNRHAEhTYKI 
LYNKIEDU^WRNVLLSWIGRRNIINWr 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCystein^ 
I>Aspartic Acid, EKjlutamic Acid, 
F=Phcnylalanine, G=<jlycine,H«*Histidine, 
TsTcnleuctne K?=Lvsine. LpLeudne; 
M=Metiiionine, N=Asparagine, F=Pn)linc» 
Q=Glutamine, R=Arginine, S»=Serine, 
T^Threonine. V«Valine. W=Tryptophan, 
Y«Tyrosine, X=XJnknown, *«Stop codon, 
/^possible nucleotide deletion, V=possible 
niirlentide insertion 


28 


1378 


A 


408 


14 


427 


TICnmiWlLDEIK/FLERHKLSKLTQEEVIiNL 
ITLKTSRETELVINK*VIPHKEKPGPDSFTGEF 
YQTFKEEL/MLHKLFQTIKYGRILPNSVYETSI 
TLKPKPEKDLVKJENYRI^PLSNIDAKXLNKTIA 


29 


1379 


A 


434 


395 


128 


lYSKMCMERQRLNN^lLKKNKVROlAVruvK 
VYYKPTVIK/rSWIL*KDSHIVEWNRLENLEID 
PN/IKRLILDKGAEATEWRKDSFFRQWQ 
-Froirrp oiJc\rrn A ovriWPKHPGFKRFSCFGLSS 


30 


1380 


A 


455 


2 


228 


SWDYRyAPPRP\ANF\*FLVETGFYyVAQAGL 
KLLSPGDLPALAS 


31 


1381 


A 


462 


393 


2 


QUvIFDKGVK>mWGW'lWFm*YWKNWlM 
CRIU^NPYLSRYIKINSR\KDLTVRPEP1KLV 
EENTGKTIQDTOLGK^FIAKTSKAQSTKTNK* 
ICRQTRYIKlJC\iCKSTASKENNRVKRQPLE*EK 

IFAN 


32 


1382 


A 


474 


125 


471 


" VKPYEIAVFLVKPIEyK*HLLSDPAIPLSUJ'l.K 
EIKAYT/RRIOTMFAAPVSVWRN»KQSKyCQ 
KQ*YVHRMEYYTnKRSEILICTTrWVDFRNT 
ILRETDRIHKTTYDVISLI 


33 


1383 


A 


488 


1825 


2 


kSACSFlCSEEQPASPSPLKPGTYASETWKUi' 

HAAGPRRDSSEAETRRPROA/DGSGTWKGT 

PGSPAPPCSWGHGG\ETEGAG»CPAAPGTDLR 

APGGSAGSnGLPSAGGSRGRKGWRAAGRQP 

STR*ORPGRHGGROE*AOHPEPRQSALQSAG 

L/ASSPEPMGAALAEDGSGDSRGAGPRPQE*P 

PSVLSRS\GS*G*G*AASGTASSPRSHSSRLGPF 

SAGFHGLRCGQPPFAAAPPGPWPGTGRPAOG 

AGSPPAAAGTAJrr A 1 )XSJJ\\ioKJSS^y*VLLi\\jru,^r\ 

SPQTAAGAGSPVQWALSRATO*TGETGSWC 

AGGTHQATHLTAAWVCPPrWSVRPGGSGPA 

AGLGR*GRHPAQSPPLPVPRG*PAWPQEAPSP 

SPASSEVALSSGSCWPDQAPGPARGSPPAPLA 

nAiTfnA Ar'Dmjmi^riBn^JAWPPPRR^STAVSL 
PAWPAAOKOKl^JK. vjivV^oftii* * w » **** 

SGTS*WRRSP*AGTRTQQC»SPWLVPACSSRP 
L*RGTRRPSTQQSPQTrGTPGRSAGPGHPRS* 
GGRSPAGTGHLGAQTVASPH*OHWPTALSCL 
Ai/ A Q A QPPrfrPF A PPOTGACIGTNCRYRAASAR 
RSSVAPACA*GWQ*AGSPPAVLRGPP*RVRER 

GALTHRPRAPDE 


"34 


1384 


A 


497 


422 


2 


■ APGASVGRAQAAEG'^RGGPlGRPPSALGVb/ii 
AGRAGRAGEGRPVPPAYPLCKSAQfTSGPPKA 
RLSVPPLASCGGRGPPGGAACATCAPPAGPAR 
SSRCRRRSPPE*GPR*PSRPARPSPGSAASRRQ 
KT.TPCRCOFRGLCA 


35 


" 1385 


A 


509 


15^ 


475 


■ ptPYPGE^QAAFLLRGPGLRPPA/DPSLK/HKisi 
LTELWAVTDENIVGLFAALLAERRVLLTAS 
KLSH.TSCDHAFCALLYPMRWEHVLIPTLPPH 
LLDYC*CPPLPRT 


36 


" 1386 


A 


.512 


3 


1631 


" Ttfsfvchlycvsptpgphgklatwupulla 

FLGLAAGGQTLCPAGELPGHARAQASGAPGS 

vliavpgrrrvhtcgpgpaapstrgecpppal 

ghtrparprpv\pfapavpqepgoqqhgaa/p 

patghsaprgcpparaaptgsatpapppaaca 

afhsawsvppagrqqg»rvpapafrrttpgt 

pgohlldrpgappaqgsgpapappprlagpa 

GPAAPPPGPPAASWHSSLSKSSSSLVGWSPPLP 

vgpgslq*tpppqgphlsgscggtsswrgqr 
aavakrlrswnacolsrvagrssasypgre 
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SEQID : 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQID I 
HO: of I 
peptide 
seq- 
uence 


Viet ! 

lod ] 


5EQ ) 
ID NO: 

in 

USSN 
09/496 
914 


Predicted 1 
)eginntng i 
lucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
oca:tion 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCystcine, 
[^"Aspartic Acid, EKJlutamic Acid, 
F=Phenylalanine, G=Glycine, fr=Histidinei 
Msoleucine, K=Lysine, L=Leucinc . 
M=Methionine, N=Asparagine, PMPxcline, 
Q=Glutamine, R=Arginine, S=Serine» 
TssThrconinc V=Valine, W^Tryptophan, 
Y«=Tyrosine, X^Unknown, *«Stop codon, 
/^possible nucleotide deletion, )r=possible 
nucleotide insertion 














GRPSQSQ*PAGPPGMRGCCLRGW»PSSSUSD 

GPGPHPASTWLRAGKTGPSPPACX3CA*LPPPS 

VSAAPQSPRTRCPRGCAAAAGLCVLAAAOAS 

HGA\GLPGVRVHrQRVHIH*GAG/GCQTPRPR 

LRSLPVLGLPAPRCPVSAHPWHRRSGSSCHA 

ADi vppwpAPf;rP**TG*VPLn'GFPEP*A*GLP 

NHQAVGLEASGALQAOHRDELPIMVQLLDH 

SPDYPLKGRPHAP 


37 


1387 


A 


620 


828 


1 


FRLPLAAGA/ROAAEFRVAVSMAfUrSAJUlt 

WEASPEMQSKCHQKGKNNQTECFNHVRFLQ 

T»T vTeTin V AnrtTUAPnPT/^AArDAEAFnJTS 
RLNSTHLYAu01il/u^V^» i-*-*'*'^^ *w 

FEEGKEKCPYDPARGFTGUIDGGLYTATRYE 
FRSIPDIRRSRHPHSLRTEETPMHWLNG*EDE 
AQDDGG*GnSSFLLPWPADHPTPKSPGEPVH 
SIPVCCQVRGQPQSGGKESPACLKSLSNCLTH 
\DAEFVFSVLVRESKASAVGDDDKVYYFFTE 
RATEKESGSFTQSRSSHRVARGIPPL 


38 


1388 


A 


739 




427 


'FRAMVSSTLKLGlSlLNGGNAEVg/i^UNK^fwu 
TSEEGKEG*EVPV*LPVSPPLPRPLQKMLDYL 
KDKKEVGFFQSIQALMQfraOEKVMADDEFT 
QDLFRFLQLLCEGHNNDFQhmATQTGNTTT 

INfflCTVDYlXRLQESl 
— _ — rn^nt r r /■» r» ^ n>\ rs f py nPtf f\ P MR OFfify^M 


39 


1389 


A 


767 


I 


1030 


TLDLTGPLLLGO V rN V rJsXFriwi«NRvrvj^j\i*« 
IO^JLSVIX}KNVDMAGFIANNGTREGCAARRN 
FCDGRRRQNGGTCVNRWNMYLCEa>LRFGG 
KNCEQGEWPASSIPPVTAAWEALLLDVPGTr 

VRGLmQVRQPLYV I AATI VL/orixu^i-'V*>-i 

RRAPAPASGVPSPSGVGWDR*AGPAEFSPSTP 

AT\/TT«3\/p\UYi fit MFRTRNKEDSVLMEATSGG 

PTSFRLQVTGAPCHQGTC*VGARGRDPMLSG 

LRVTDGEWHHLLIEUCNVKEDSEMKHLVTM 

TLDYGMDQVSWHLHLLWG*TLPPAQGKTGA 

SEDKVSVRRGFROCMQVRGGCGORGEACPS 

CSlk. APRT 


40 


1390 


A 


801 


19 


"399 


ItflaDHKEDLhOCWkVlLCtiGMiiRl^VMlFVV 
PQlIYKFNA*Q\VILKFTW*E*OAKmLRKNKL 
RGl VLVPLSTC*VKYLLDKVLPHIKTYYEAR 
VNKSWLVOVnM 


41 


139) 


A 


835 


7 


"155 


■ - SMLKERKVFQFPSCLFFQYlTWLGPPYHVLi'U 
<;SYT^^«sinAK*DlI.OSVMNCLYAKRIPCVT 


42 


1392 


A 


841 


1 


415 


' "GSTHASGYDKTPDFILQVPVAVEGHIIHWli^S) 
KASFGDECSHHAYLHDQFWSYWNSLKHRTW 
QGIGTVAmSQL*TLNAPFPELLLFRSLARTG 
FVLT*\RFGPGLVIYWYGFIQELDCNRERGILL 


43 


1393 


A 


845 


358 


92 


- -pALSPAPVPQKKQSPLPLDPCLGPSSWLLSVG 
LGWPRL*PRRGPGDPGSLPATPPLLTPPHTLLP 
QRPMLPPSHAGLARPPPPEPISVP 


44 


1394 


A 


853 


452 


1 


■ LPQYCFFPRLSPKSKLVKHbAL' bALJSjr i a. 
SPRCIPRTSLYFTICC/PPALQUSPIEDPPAIYRS 
PPTHMLRSASQPLNQAPTLVKGHPPSRFLQG 
QVSCPPQPTLPREKPLPLHLRPPPRPAQPPLPR 
PLTFSTRRNVDPEIPERFR 


4S 


1395 


A 


894 


379 


162 


" GVYPPTVFDNYSVQTSVDGQIYSLNlWUiAij 
QEEYD/RLRTLS'PQTSIFVICFSIGNLEFPIYGrr 

WLSMSMGK 


46 


1396 


A 


900 


1 


"l66 


" "rrKKTLISNNVSSRSU>lLPl£LKAFSlJ\tNL>l'l- 
EIQKYMRT/DQ*CVTHDISLYIVTKLALIFLIPR 
VFIJHQLNrr**CXHFFTMnHAIPFSFLFLGR 
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SEQID & 
NO: of > 
nucl- p 
eotide s 
seq- V 
uence 


EQID ^ 
40: 01 H 

eptide 

eq- 
lence 


^et S 

ii 
I 
C 
< 


EQ I 

n r 
JSSN 1 
>9/496 < 
U4 1 
1 


Predicted I 

lucleotide 1 
ocation i 
»rrespondi 
ig to fust 
unino acid 
residue of 
Kptlde 
sequence 


Predicted end i 
lucleotide 
ocation 
:orresponding 
^ last amino 
acid residue 
of peptide 
sequence 


\iDino acid sequence QA-Alanine OCysteinc, 
>=Aspartic Acid, &=Glutainic Acid, 
'^Phenylalanine, 0=Glycine, H=Histidine, 
:=Isoleucine. K^Lysine, Lp=Leucine, 
M=Methiomnc, N^Asparagine, P^Proline, 
Q=GIutamine, R=Aiginine, S^Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 

VBsTvrnqine XMJnknOWn. *^tOD OOdOA, 

/=possiblc nucleotide deletion, V-possible 
nucleotide insertion 
D/KSLAMLPRLVSNSWFQVILPP 


47 


1397 


A 


944 


162 


2 

308 


QLQNLASRGCL*SQLLRRLKiOiNRLNPCKKiO 

CSEIArVC 1 r A W V 1 ^^i\ur r iuvinjw 
HFTPDRIAIVKNTRDSHCWRGC*EEG^ 


48 
49 


1398 
1399 


A 
A 


963 
967 


216 
466 


1 


PRKBESWWGERLP/PRGFPPAAEDAPAPUWK 
GRKHASRTARAHVFHPIRQSIRSPVRGRPGDP 
IUAHmSAGTRLQCKASRGG*GKGPAPTR*E 
GGPGSAPAPLPASSGCSLFPDSSPWTTPPPAPG 
AAAAGP**TPRCPAALRAGAHIGRVGRPY 


50 


1400 


A 


973 


45 


421 


EKCIQALDVFVFCYIDHSSHCLMSCD'EAJQA 
LNFMPLEMEPKMSKLAFGCQRSSTSDDDSGC 
ALEEYAWVPPGLRPBQIQLYFACLPEEKVPY 
VNSPGEKHRnCQLLYQUPPHDNEVRYCXISLSE 

E 


51 


1401 


A 


992 


2095 


155 


IRIRHEAARSCLGCAAGHVPAPGLRLLPTVKU 

PPGRRGPAAPGCVCY*SGESTFVSHVPQRMA 

WPGSAPPRGFHPLQSQTSPSDTVSSPQLSKEE 

DGPGWEHPLSSSL*SLGQAGGNH*QPEELAG 

WEPRGPPSLAPSSPT/TMWTALVLIWIFSLSLS 

ESHAASNDPRNFVPNKMWKGLVKRNASVET 

VDNKTSEDVTMAAASPVTLTKGTSAAHLNS 

MEVTTEDTSRTDVSEPATSGVAADGVTSIAPT 

AVASSTTAASnTAASSNfTVASSAFITAASST 

WASIAPTTAASSMTAASSTPMTLAIJPAPTST 

STGRTPSTTATGHPSLSTALAQVPKSSALPRT 

ATLATLATRAQTVATTANTSSPMSTOPSPSKH 

MPSDTAASPVPPMRPQAQOPISQVSVDQPW 

KirNKSTPMPSNTTPEPAFTPTVVTITKA^^ 

EPTASPVPVPHTSPIPEmEAMSFI ivrM'Mri i 

ORAAGPGTSQAPEQVETEATPGTDSTGPTPRS 

SGGTKMPATDSCQPSTQGQYMV/DHH*APHP 

GRGRQNSPSGOAVTRGDPFHHSLGFVCPAGL 

♦ELQEEGLHPGGLLNQRDVCGLRNVRGAGA 

WREAWPLPRPFLLPLRPNQVLPNSFGAIEEIC 

pMLKHI 


52 


1402 


A 


994 


1 


462 


ESGEFLVSFTLKKl'lNYftuuisuiyuwrriixwi^i* 
*smDIAFYKIQHPFMLKALTKWA*EGT*PDR 
RYLH*SLRLNGEQLKTFPLRSGMR*G/CAILPL 
VI TMAMLSrVPAWPAGKTRHEKEITCPLIGQE 
Plf*FS*FVGDMNTCVENKKESICKLLE 


S3 


1403 


A 


" 1011 


1 


630 


■ PEVIQQSAYDSKADIWSUilTAIELAKGEPPWS 
DMlffMRVLFLIPKNNPPTHCWRRLLESFKEV 
AiviiT A*wnPQi\wPTAKELLKHKFIVKNSKKT 
SYLTELIDRFKRWKAEGHSDDESDSEGSDSES 
TSRENNTHPEWSFTTVRKKPDPiCKVQNGAEQ 
DLVQTLSCLSMIITPAFAEIXQQDENNASRNQ 
ATEELEKSIAVAEAAGPG 


54 




A 


1016 


1 


222 


•■ lsiDA*KAFDKIQH/CFMmLKKLGIDGKYLN 
TIKAIDDRHTVSTILNvEKLKArL'KS>u i kv^kj 

PISGSGARI . ■ 

" "HASVDGDEGSDDVYYYYTPAILRELQALN lA 


"15 


1405 


A 


""Tosr" 


"1 ~ 


"l66 


EAAEHRPEEDRMLSEDPWRPAHMIKGYMPL 
HNIPHTEVIDVTGLNQSHLYQHLNKGTPMKT 
nifWAANLYTWHVLEOLEILRQINQQSHGPG 


56 


1406 




1044 


" 5 


"429 


" SVLTlQTRSPSKPLSUlKUyiUWEVVSRNSlSli 
DRLETQSRASRSPPVTPNQSQEIPVDGKPUU. 
PPNQSQKNmYHIHYLHLQYYLDRHISATLPff 
SSSGimiAVTTDALTDLVELILGQPCSEESGR 

APGTLFLLAL . 
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SEQIU 1 ^ 
NO: of 1 
nucl- I 
eotide i 
scq- 1 
uence 


JEQID I 
sIO: of I 
)eptide 
!eq- 
lence 


4et S 
lod I 
i 


5EQ 1 I 
DNO: I 
n I 
JSSN 
)9/496 ( 
?14 1 


>redicted ] 

jeginning 

lucleotide 

ocation 

:orrcspondi 

net fn fii"ct 

amino acid 
residue of 
peptide 
sequence 


Predicted end 

nucleotide 

location 

rn rrftsrmndtnfi 

to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence ( A=Aianine OCysteine, 
D=Aspartic Acid, EF<jlutamic Acid, 
F=Phenylalanines 0=Glycine, H=Histidine, 
hlsolcucinc, K«Lysinc» L-Leucine, 
M=Methioninei N=Asparagine, P=Prolinc, 
Q=Glutamine, R=Argiidne, S=Sainc, 
T=Threonine. V«Valine, W«Tiyptophan, 
Ys^yrosine, X=Untaiown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 


57 


1407 


A 


IUjU j 




430 


GAYAFETNGFPIMLVLTTDKIEGDVGIAGLYD 

MHUSLPMAI^UITLVRCTSYIIPVTHVLSTPV 

TCLRRREKDGVIVDVLSDrASNHNGFPVEEH 

ADDTHPARLQGFTLRSQPMGPLKHKAFEERA 

N!.GLVORRLRLED 


58 


1408 


A 


1058 


258 


419 


LKHRDTPWGANNRALSCTPLTSLTLCALU'L 
PCLGCFTXATCRLYQTTVAWF 


59 


1409 


A 


1064 


3 


425 


"kAFSFTTSLlGHQRMHTGERPYKCKECXjKTi' 
KGSSSLNNHQRIHTGEKPYKCNECGRAFSQC 
SSLIQHHRIHTGEKPYECTQCGKAFTSISRLSR 
HHRIHTOEKPFHCNECOKVFSYHSALIIHQRIH 

TfiEKPYACKDVGK ^ 

"GGPPGPFLAHTHAGLQAFijl*LLAPAGDEUUL 
LLLAVQQSCLADHLLTASWGGK/DPIFIKALG 
EGOEGLPLTV 


hgo 1 


1410 


A 


T065 1 


"204 " 


"?l5 ~ 


61 


1411 


A 


1079 


3 


383 


-MSRmCQPFHLVMRDLLQLGQDIPQGCHY 
LEENHLIHRDIAARNCLLSCAAPTRAATIGDF 
GMAKYIYRTRYYQLGDRAIAPRKWMPPEAL 
T prnmTsrmsWTFGVLLWEIFSLGYMPYPGR 

TN 


62 


1412 


A 


1080 


1 


859 


WEFLWSRRPSGSSUPRPRRPASKCgMMCiiK 

ANUfflMMKLSIKVLLQSALSLGRSLDADHA 

pmOFFVVMEHCUfflOLKVKKSFIGQNKSFF 

GPLELVEKLCPEASDIATSVRNLPELKTAVGR 

GRAWLYLALMQKKLADYLKVLIDNKHLLSE 

FYEPEALMMEEEGM\aVGaVGLNVLDANL\ 

aKGBDLDSQVGVIDFSLYLKDVQDLDGGKE 

HERITDVLDQKNYVEELNRHLSCTVGDLQTK 

IDGLEKTNSKLQERVSAATDRICSLQEEQQQL 

REONELIR 


63 


1413 


A 


1083 


2 


615 


SSFAKHKRIHrcEKPFICLECGKAFTSSTTL 1 K 

HRRIHTGEKPyTCBECGKAFRQSAE.YVHRRI 

HTGEKPYTCGECGKTFRQSANLYAHKKIHTG 

EKFYTCGIXXiKmQSANLYAHKKIHTGXEKP 

YKOCECGKAFKSYYSILKHKRTHTRGMSYEG 

DEaQRSLN/RSSILSNHKUHNEEK/PLKCEKCE 

KAFNHTSICCRHKKN 

'KKQDLSSSLTDDSKNAQAPLALTESHLATLA 
SSSQSPEAKQLLDSGLPSLLVRSLASFCFSHIS 
SSESUQSIDISQDKUUUDIWQQCNKMPrrAD 
LVAPILRFLTEVGNSHIMKDWLGGSEVNPLW 
TALLFLLCHSGSTSGSUINLGUQQDQCKISFS 
FFSWLTTGLTTQQRTAIE\NATVAFF\LQCI\SC 
HFNNQKLMAQVLCELFQTSPQRGNLFTSGNI 
S\GFm\RLFWU«LEDEKVTMFLQSPCPLYKG 
RiNATSHVlQHPMyGAGHKFRTLHLPVSTTL 
snVLDRVSDTPSTTAKLlSKQKDDKKKK 


[64 


1414 


■~A 


1084 


T946 


"1 


165 


1415 


A 


1087 


103 


324 


' PRAFH^TENirVG/RVQNIHLFTLQVLKUKA 
1 irTXircvr5Q<5i W^TYT THVMALP/DRELLKPNA 
SVALHKLSNALV 


66 


1416 


A 


1095 


3 


493 


- -HETCSVTHIVSFSLPFLNPSHPASTPGHl bNHQ 
PSLVWFDRGKFYLTFEGSSRGPSPLTMGAQD 
TLPVAAAFIETVNAYFKGADPSKCIVKITGE 
Nm.SFPAGlTRHFANNPSPAALTFRVINFSRlJB 
HVLPhn>QLU:CDNTQl^Al^^FWVNMPNL 

mthlk^ 


67 


1417 


A 


1098 


57 


356 


" LKLTSLGFliGVSWGI^LLlSlLLVKDKTLiiKA 
PYYFLLDLCXSDILRSAICFPFVFNSVKNGST 
WTYGTLTCKVIAFLGVLSCFHrAFMLFaSVT 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«AJanine OCystcinc, 
l>Aspaitic Acid, EpQlutamic Acid, 
F=Ph«iylalaiune» G=Olycine» H=Histidme, 
Msoleucinc, K^Lysine, LpLeacin^ 
M=Methionine, N=Aspaiaginc, Ps=Piolme, 
Q=Glutamine, R=Arginine, S^Serinc, 
T-Threonine, V-Vallnc. W-Tiyptophan, 
Y-Tyrosinc, X=Unknown, ♦-Stop codon. 
possible nucleotide deletion, V^ossible 
nucleotide insertion 
RYL 


68 


1418 


A 


1106 


1 
• 


1326 


MGKISATGINMGTKCSWALVWHLESYDPKH 
YEREGMQDWKTASGQSEEATQQSSQKPQPH 
YTTYQSSSFLKYSSESHLLAWRENSSEGSFQF 

HPQSAAQPQLRAAARIPESPAAFPAQPRPGSA 

RNSDASGPASLSRTLGRASSPRPPQAPDVTAP 

SPAALAPRAARGGSRAAALAGAEAEEPLRTL 

APRPTRAAAPPPPPPPPPLPPGAPPPPVRCVSR 

IlARAPPWR/r AA 1 url^i^Kr V Ar^wsJAjoMJW^ 

APALQIRKGTSSGLPGRGGGSGPGNNLSSVA 

GNWRGSSFAVERPGMAKYQGEVQSLKLDDD 

SVIEGVSDQVLVAVWSFALIATLVYALFRNV 

HQNIHPENQELVRVLREQLQTEQDAPAATRQ 

QFYTDMYCPlCLHQASFPVEmCGHLFCGSLT 

PNSIW 


69 


1419 


A 


1107 


2 


"466 


"FDTARLHEFGTSrrqiFAVDNREDLQKWMEA 
FWQHFFDI^QWKHCCfiELMIUlMMoriau'ri.r 
LTKEATSVYHDMSIDSPMKLESLTDnQKKIEE 
TNGQFLIGQREESLP/SS/CXJPHSLMVnKWSS 
RKRY/SYPASEPLHDBKGKKRQAPLPPSDK 


70 


1420 


A 


nil 


698 


23 


ALRRLHYVRATKVXrLor KKri* WtuiliriLiiUVJXi 

SNTDRPSRMIFYPPPREGALLLASYTWSDAAA 

AFAGLSREEALRLALDDVAALHGPWRQLW 

DGTGWKRWAEDQHSQGGFWQPPALWQT 

EKDDWTVPYGRIYFAGEHTAYPHGWVETAV 

KSALRAAIKINSRKGPASDTASPEGHASDMEG 

QGHVHGVASSPSHDLAKEEGSHPPVQGQLSL 

ONTTHTRTSH • 


71 


1421 


A 


1119 


2 


385 


QKQTLQNGYLDSSltolLYLGSLPPELQVSSDfc 
PPGPPEQAGLSQFHLEPETQNPETTEEIQSSO-Q 

A A A /NT Tv^f ncmnzi CCTVAWAPAI P^O^l- 
QEAAAQLPQLPbW JiLoo i RAuiAr nuroKioij 

EGVHSSTEQKAPAQQLPAFEEn-APLLIHHE 


72 


1422 


A 


1127 


1 


906 


HAQYVGPYRLEKTLGKGQTGLVKLGVHCIT 

GQKVAKTVNREKLSESVLMKVEREIAILVRLI 

ElffHVLKLHGVYENKKYFPPDELTSGPSMLA 

QVSraOKLSARRSWDLLSGFPRYLVLEHVSG 

GELFDYLVKKGRLTPKEARKFFRQIVSALDFC 

HSYSICHRDUa*ENIXUDEra4NlRIADFGMAS 

LQVGDSLLETSCGSPHYACPEVIKGEKYDGR 

D An\4wcp/3VTT UAT 1 VGALPPODDNLROLLE 

KVKRGVFHMPHFIPPDCQSLLRGMIEVEPEKR 

LSLEOIOKHPWYLGONFIS 


73 


1423 


A 


1128 


I 


802 


' LRNALDVLHREVPRYLVNLVDFLNFl'lMRgV 
FLGNPDKCPVQQA/MLEPLGSKTETLDLRAE 
KvTPirr'PTnrjPPlTLRTPRNSNYTYPIKPAIENWG 
SDFLCTBWKASNSVPTSVHQLRPADIKWAA 
LGDSLTTAVGARPNNSSDLPTSWRGLSWSIG 
GDGNLETHTILPNILKKFNPYLLGFSTSTWEG 
TAGLNVAAEGARARDMPAQAWDLVERMKN 
SPDINIXKDWKLVTLFIGGNDLCHYCENPEA 
HLATEYVQHIQQALDILSE 


74 


1424 


A 


1139 


60 


480 


■ ' FREPCLLVPGDHQPLREASWLA/LPPIGLWCj 1 
DSPLCCVEVAIPCNKGAHSVGLKGWLLAQG 
VLGMRDTIPQEHPWESTPDLCFCRDPEEIEVE 
EQPAADAAVAKGEF/QGEQIAPVPAUIAAHPE 
AADPAPVHTTAHPKGA 


75 


1425 


A 


1147 


2 


413 


■ -pFPHQHPQEPNKGSCWPQSALROQCPOFVLOV 
TTTSDLCSLQVPVSSHRNPLLDLAAYDQEGR 



140 



wo 01/57188 



PCTAJSOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pq)tidc 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucieouQc 
location 
corresponding 
to last amino 
acid residue 

01 pcpiIOw 

sequence 


Amino acid sequence (A^Alanine OCysteine, 
]>Aspartic Acid, EBClutamic Acid, 
F-Phenylalanine, G^lycinCi H-Histidinp, 
I=lsoleucinc, K^Lysinc, L=Leucine, 
M=Methionine, N==Asparagmc, INPioline, 
QKjlutamine, R-Arginine, SaScrine, 
T^Threonine, V=Valine, W«Tryptophan, 
Y«Tyrosine, X=Unknown, ♦*6top codon, 
/^possible nucleotide deletion, V^ossiblc 
nucleotide insation 














RFDNFSSLSIQWESI'RPVLASIEPELPMQLVSQ 

DDESGQKKLHGLQAILVHEASGTTAITATAT 

GYQESHLSSAR 


76 


1426 


A 


1155 


38 


410 


PnSAPAQDDPDLLSFIHCLHANLLCVWRRDVK 
PDCKEIWIFWWGDEPNLV\VQYIMNCMLWK 
KDSGKMAFPMNVGRDFFKEIHNIXERCLMD 
KNFVLIGKWFVRPYYKDEKPVNKSEH^ 


77 


1427 


A 


1162 


526 


350 


"Wqgledvstyfvliheusrgwseeelqgv 

LRGNLLRVFROVEKVQEENKWQSPLED 


78 


1428 


A 


1171 




1293 


maesasppsssaaapaaepgvtteqpgprsff 
ssppgleepldgadphvphpdlapiaffclrqt 

TSPIWWCIKMVCNr Wrfc*^VblVlL.YJJ-^ J J-t 

gmyqpcddmdclsdrcke.qyfddfififfa 
memvlkmvalgifgkkcylgdtwnrldffi 
\t^gmveysijdlqnini^amtvrvua»lka 
inrvpsmril\^lldtlpmlgnvlllcffvf 

FIFGIIOVl^L W AOLfL«l\riivv^rJ^DOi^r i i\^\jm^ ▼ 

pp\yyqpeeddempficslsgdngimgcheipp 
cvnwnryynvcrtgsanphkgainfdnigy 

AWIVIFQVniJEGWVEIMYyVMDAHSFYNFI 

yfilliivsvrepgllggsfstaqspkcqgdsfp 


79 


1429 


A 


1175 


I 


405 


PNDFFKDMFPDLPGGPLGPnCAENDYGAYLN 
FLSATHLGGLFPPWPLVEERBCUCPKASQQCPI 

chkyimgagklprhmrthtgekfymcnce 
vrftrqdkuahmrkhtgerfylcihcnakf 

vhnydlknhmr 


80 


1430 


A 


1182 


25 


198 


emnelsqqlsqqggrgasqcpsppaptlpnft 
plcololorvntglptppchpoagaa 

vmrr Twrn A nrmTl QTPP AO AG ARRVYAVEAS 


81 


1431 


A 


1186 


254 


583 


K. 1 V LJL> V (jAvj 1 VJULiOir \^e\\llvjr\ni\ ^ irxt im-im 

aiwqqarewrfngledrvhvlpgpvetvel 
peqvdaivsewmgygllhesmlssvlhartk 
vvkdggfflpxsselfm 


82 


1432 


A 


1187 


2 


716 


nrvnAATiTaT PT RSTKSPAEPSKSVPSLESDPRA 

ssqglpsqgpvqnqgrrqeqrpkkfnviqht 
ssfeksdsleqpsglegedkplaqfpspppaph 
grsahslqpklvrqpniqvpeilvteepdrpd 
tepeppfkepekteefqwpqgsqtlaqfpvek 
lppkkkrlglakmaqssgessfessvplfrsp 
sqesnvslsgssrsalferddhokaeapspsf 
■ dmgpkplgthmltv 


83 


1433 


A 


1188 


517 


804 


■ espglskvlrtgafaypflfdnlplfyrlouj 
wgrghgcgoealstshgyhlfcalltgflfa 
shlperlapgrfdyighshqlfhicavlgthf 

0 


Sid 


1434 


A 


1192 


45 


476 


" LGDVGFWVERTPVHEAAQRGESLQLQQLlfcS 
GACVNQVTVDSrrPLHAASLQGQARCvgLLL 

aagaqvdarnidgstplceclrlgqhrvcba 
lavlrgqgqpspvhsvpparglhxrefrmc* 
gflfdvgxnleahefhfoep 


85 


1435 


A 


1194 


69 


410 


■ ■ KRSEEASAPPFPLGGTGAAPTRASLPEgiLLrK 

sclearksqpdekllsalhnsrtwn»eprrsq 

HRLVSPEVHPGRRGSSPGVAECKLTSAYFRT 
GRSPCPSLPGTTRTOSLL 


86 


1436 


A 


1215 


3 


405 


- IpsHTCGNPGRLPNGIQQOSTFNLGDKVKy SO 

nlgffleghavltchagsensatwdfplpsc 

RADDACGGTLRG/AEWHHLOPPLPLG/ATKN 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
KO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


T^ino acid sequence (A^Alanine OCysteine^ 
D^Aspartic Add, EFOlutamic Acid, 
F«4^ylalaniae, OOlydne, H^Histidine, 
I=Isoleudne» K=Lysine, LFLeucine, 
M^Methionine, N=Asparaginc, P=Proline, 
Q^GIutaminc, R^Arginine, S^Scrinc, 
T=Threomne, V=Valine» W«Tiyptophan, 
Y«Tyrosine, X«Unknown, ♦'^top codon, 
/possible nucleotide deletion. V=7iossible 
nucleotide insertion 














NADCTWTIU^EU5DTlALVFlDFQlJEDGYDnr 
EVTGTEGSSLW 


87 


1437 


A 


1216 


226 


964 


GTARFGPMVGFGANRRAGRLPSLVLGVLLV 
VIWLAFNYWSISSRHVLLQEEVAELQGQVQ 
■DTPVAPOTJT PK^RMSDI FAWGHAOETDRPEG 
GRLRPPQQPAAGQRGPREEMVEDDKVKLQNN 
ISYQMADIHHLKEQLAELRQEFLRQEDQLQD 
YRKNNTYLVKKLEYESFQCGQQMKELRAQH 
EENIKKLADQFLEEQKQETQKIQSNDGKELDI . 
NNOWPKNIPKVAENVADKNEEPSSNHIPHG 


88 


1438 


A 


1218 


1 


534 


PEFGTTISCGYLMATDVSRRPSVHKAVEIEQJB 
RVKSAGAWUHPYSDFRFYWDLIMLIXMVGN 
LrV^PVGITFFKEEN brVr Wl V r JN VLtou i rrisLiU 
LVLNFRTGIWEEGAEILLAPRAIRTRYLRTW 
FLVDLISSIPVDYIFLVVELEPRLDAEVYKTAR 
ALRIVRFTKILSLLRL 


"89 


1439 


A 


1223 


1 


743 


MGFDEVFMINLRRRQlJKJKisKlWLJiAJuVAy^ 

CRLVEAVDGKVGMLTRSNAAPGRHLAMLET 

LVWAPRFVDADNLILNPDTLSIXIAENKTVV 

APMLDSRAAYSNFWCGMTSQGYYKRTPAYI 

PIRKRDRRGCFAVPMVHSTFLIDLRKAASRNL 

VAFYPPHPDYTWSFDDUVFAFSaCQXAEVQMY 

VCNKEEYGFLPVPLRAHSTLQDEAESFMHVQ 

LEVMVPSSPSSAQSMAWSADHIGLVISYL 


9b . 


1440 


A 


1227 


2 


349 


"NKTSFJFYLKNrWADUMTLTFPFRlVHDAGF 
GPWDFKFILCRYTSVLFYANMDTSIWLGLrr/ 
YDRY/WKVVRHL/WDSWMTGI/SFTRVYLLG 
LGARLVWFGKLILAKGGHGGISWL 


91 


1441 


A 


1245 


3 


1937 


LGSSDVRAPQRSELGAESPSRMVASQAYNLT 

SALTPILTRSRVLNEEPLTLAGFNSRAPANLSD 

WQLIFLVDSNPFPFGYISNYTVSTKVASMAF 

QTQAGAQIPIBRLASERATTVKVFNNSDWAAR 

GHRSSANSWVQPQAFVGAWTLDSSNPAAV 

LHLQLNYTLLDGRYLSEEPEPYLAVYLHSEPR 

PNEHNCSASRRIRPESLQGADHRPYTEFISPGT 

MJPVGSYRLNLSSHFRWSALEVSVGLYTSLC 

QYFSEEDWWRTEOLLPLEETSPRQAVCLTR 

HI ^TAr Cli QL*r V rraiUKJ V rrc»r l v iAyiTA*-« 

TCAVCXVTYMVMAAILHKLDQLDASRGRAIP 

FCGQRGRFKYEILVKTGWGRGSGrrAHVGIM 

LYGVDSRSGHRHLDGDRAFHRNSLDIFQIATP 

HSLGSMWKIRVWHDNKGLSPAWFLQHnVRD 

LQTARSTFFLVNDWLSVETEANGGLVEKEVL 

A A QV A siPB VPTPSVAALLRFRRLLVAELQRGF 

FDKHimSIWDRPPRSCFTRIQRATCCVLLICL 

FLGANAVWYGAVGDSAYSTGRVSRLNPLSV 

DTVAVGLVSSVWYPVYLAILFLFRMSRSKV 

GWGWGPGSTGNGAWASAPCPEPPLSSAAAR 

GKGVHORLLGKGQHT 


92 


1442 


A 


1246 


5 


562 


■ "VFDEENlLNELNDPLR^VNFNCRKLVATMr 
LFANADPNmAMLSKLRFEVFQPGDYIIREG 
AVGKKMYFIQHGVAGVTTKSSKEMKLTDGS 
YFGEICLLTKGRRTASVRADTYCRLYSLSVD 
NFNEVLEEYPMMRRAFETVAIDRLDRIGKKN 
SILLOKFOKDLNTOVFNNOENEILKQIVKH 


93 


1443 


A 


1249 


180 


901 


■ -tVppppggpspaplhpkrsptstgeaelkeerl 

PGRKASCSTAGSGSRGLPP\SSPMVSSAHNPN 

KAEIPERIUa)STSTPNNU»PSMMTRRNTYVCT 

ERPGAERPSLLPNGKENSSGTPRVPPASPSSHS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 . 


Predicted 
beginning 

niiclMitidA 

location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
locatioa 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D='Aspaitic Acid, E=61utaniic Acid, 
F^Phcnylalanine, GKilycine, H=Histidine, 
Msoleucinc, K==Lysine, L=Leucine, 
M=Methionine. N=Asparagine, P=Proline, 
Q=<7lutamine, R^Arginine, S»Serine, 
T^Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *«Stop codon, 
/possible nucleotide deletion, \^ossibIe 
nucleotide insertion 














LAPPSGERSRLARGSTKSTFHGGQVRDRRAQ 
CjGGGGGVQNGPPASPTLAHEAAPLPAGRPRP 
TimFTKLTSKLTWlVADEPERIGGPEVTRRP 
RQEDHLSPGGRGCSEL 


94 


1444 


A 


1261 


3 


385 


KFSQWGLTKPKI^ASP/VWSLVKKLMKKWS 

VTQNLTFREQLEAGIRYFDLRVSSKPGDADQ 

EIYFIHGLFGIKVWDGLMEIDSFLTQHPQEIIFL 

DF^raFYA^!DETHHKCLVLRlQEAFGNKLCPA 

CR 


95 


1445 


A 


1282 


2 


550 


GPRDNPG\EDPRFEIVEHF01AWFrFELVARi'A 

VAPDFLKFFKNAU^OJDLMSIVPFYnLVV^ 

VVESmLANLGRVAQVLRU^RIFRILKLARH 

STGLRSLGATLKYSYKEVGaLLYLSVGISIFS 

WAYTIEKEEN\BGLATIPACWWWATVSNfrT 

VQYGDWPGTTAGKLTASACILA 


96 


1446 


A 


1294 


1 


1456 


QLLPPSNRENAGLLVGRCLCSAALRPVGDUT 
SSGQVAVRNAPQAGSAKAOKGKFQDNFEFIQ 
YFKKFFDANCNEKDYNPVAAGQGQETEVAP 
SIVAPVLNKPNQCPEGYICVKAGRNPNYGYT 

AETmOT/LV/LVILLGSLYLVTLILAVA^AMA 

YEEQNQATLEEABQKEAEFQQMLEQLKKQQ 

EAAQQAATATASEHSREPSAAGRLSDSSSEAS 

KLSSKSAKERRNRRKKRKQKEQSGGEEKDED 

EFQKSESEDSIRRKGFRFSIEC3NRLTYEKRYSS 

PHQSLLSIRGSLFSPRRNSRTSLFSFRGRAKDV 

GSENDFADDEHS1FEDNESRRDSLFVPRRHGE 

RRNSNLSQTSRSSRMLAVFPANGKMHSTVDC 

NGVVSLVGGPSVPTSPVGQLLPEVnDKPATD 

DNGTTTETEMRKRRSSSFHVSMDFLEDPSQR 

QRAMSUSILTNTVE 


97 


1447 


A 


1295 


2 


2057 


IQTQLPTKSSQQLRKGGNCVRCKMQMNFIAE 

EVLLKYRrrFyNNNKGPNMLYIEIKAFVHFMI 

NRYLSYGSGPKRFPLVDVLQYALEFASSKPV 

CTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALS 

SELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLP 

MHPAPRHITEEELSVLESCLHRWRTEIENDTR 

DLQESISRIHRTDELMYSDKSMIQVPYRLHAV 

LVHEGQANAGHYWAYIFDHRESRWMKYNDI 

AVTKSSWEELVRDSFGGYRNASAYCLMYIN 

DKAQFLIQE\DLIKTGQPLVGIETLPPDLRDFV 

EEDNQRFEKELEEWDAQLAQKALQEKLLAS 

QKLRESETSVTTAQAAGDPKYLEQPSRSDFSK 

HIJKEETIQnTKASHEHEDKSPETVLQSAIKLE 

YARLVKLAQEDTPPETDYRLHHWVYFIQNQ 

APKKIIEKTLLEQFGDRNLSFDERCHNIMKVA 

QAKLEMIKPEEVNLEEYEEWHQDYRKFRETT 

MYLDGLENFQRESYIDSLLFLICAYQNNKELL 

SKGLYRGHDEELISHYRRECLLKLNEQAAELF 

ESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEE 

KDILAVEDMRNRWCSYLGQEMEPHLQEKLT 

DFLPKLLDCSMEIKSFHEPPKLPSYSTHELCER 

FARIMLSLSRTPADGR 


98 


1448 


A 


1304 


118 


453 


SGPSSRAIYLHRKEYSQNLTSEPTLLQHRVEH 
LMTCKQGSQRVQGPEDALQKLFEMDAHGRV 
WSQDLILQVRDGWLQLLDIETKEELDSYRLD 
SIQAMNVALNTCSYNSDLS 


99 


1449 


A 


1306 


3 


1660 


" CGYFCHTTCAPQAPPCPVPPDLLRTALGVHPb 
TGTGTAYEGFLSVPRPSGVRRGWQRVFAALS 
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SEQID i 
NO: of 1 
nucl- I 
eotide s 
seq- \ 
uence 


JEQID ! 
AO: of I 
>eptide 
;eq- 
lence 


vlet S 
iod I 
i 
1 

1 


SEQ 

DNO: \ 
n 1 
JSSN 
)9/496 

m 


Predicted 1 1 
beginning i 
ducleotide 
location 
correspondi 1 
ng to first 
amino acid i 
residue of 
peptide 1 
sequence 


Predicted end 1 
nucleotide 
location 
corresponding 
to last amino | 
acid residue 
of peptide 
sequence 


^ino acid sequence (A*=Alanine C>=Uysteine, 
D^Aspartic Acid, &<jliitamic Acid, 
'^Phenylalanine, 0=Glycinc, H=Histidinc, 
l=Isoleucine, K=Lysine, L=Leucinc, 
M=Methionine. N=Asparaginc, P=Prolinc, 
Q=Glutanunc, R=Argbine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/t=possible nucleotide deletion, V=possible 
nucleotide insertion 














bSRLLLFDAPDLRLSPPSGALLQVLDLRDPQh 
SATPVLASDVIHAQSRDLPRIFRVTTSQLAVPP 
TTCTVLLLAESEGERERWLQVLGELQRLLLD 
ARPIU^RPVYTLKEAYDNGLPLLPHTLCAAILD 

QDRLALG 1 KcvjLr V IrUUKoiNLFir v v ▼ v< 

QLTLSPSAGLLWLCGRGPSVRLFALAELENI 

EVNEVPKIPESRGCQVLAAGSILQARTPVLCVA 

VKRQVLCYQLGPGPGPWQRRIRELQAPATVQ 

SLGLLGDRLCVGAAGGFALYPLLNEAAPLAL 

GAGLVPEELPPSRGGLGEALGAVELSLSEFLL 

LFTTAGIYVDGAGRKSRGHELLWPAAPMGW 

GYAAPYLTVFSENSIDVFDVRRABWVQTVPL 

tj-i^wtnm xrocnCT UT V/TTPTTVllT TYLRNOLAE 

KDEFDIPDLTDNSRRQLFRTKSKRRFHFRVSE 
EQQKQQRREMLKDPFVnRSKLISPPTNFNHLV 

HVQPANORPGARDKSP 


100 


1450 


A 


1318 


918 


190 


""SLCVPGPVDTGTFAVMSVMVGSVTESLAPgA 
LNDSMINETARDAARVQVASTLSVLVGLFQV 
GLGLIHFGFWTYLSEPLVRGYTTAAAVQVF 
VSQLKYVFGLHLSSHSGPLSLIYTVLEVCWKL 
POSKVGTWTAAVAGWLVWKLLNDKLQQ 
QLPMPIPGELLTLKjA 1 ulo i umuurjixu- cj^vix 
PPVAPNTQLFSKLVGSAFTIAVVGFAIAISLXjK 

ifalrhgyrvdsnqvwvmrdv 


Tol 




■"A 


1353 


"SB 


"355 ^ 


"OWPOLFTYPLIGSPKCl-l^SARPENRMYKKlVK 
SSHGNHALQEVLPRSGHGTEFTKQKHLEAAD 

HOHPPARMblrDK ^ 

AHLLMLNLALVrDLLXYLTSLPfLm^^ry 

WIFGDFMCKFIRFSFHFNLySSim,TCFSIFRY 

CVUHPMSCFSIHKTRCAWACAWWnSLVA 

XOPNITFUTSTNRTNRSACIDLTSSDELNTIKW 

<i/vTT IT T AU I rr PT VTVTf PYTTTIHTLTHGHAN 

\nsCLKOKARRLmLL 




"1452 


"A 


"1363 


^542 


|2 ^ 


103 


1453 


A 


1371 


2 


410 


rCHSTESSSDFILPGDYLLGGLCa>LHSGCLgv\i. 
SFl^mHGYHLFQAMRLAVEEINNSTALLPNITL 
GYQLYDVCSDSANVYATUIVLSLPGQHHIEL 
r\nr^j i tivcPTV! AVIGPDSTNRAATTAALLSP 
FLVPMLLEO 


104 


1454 


A 


1376 


3 


432 


TNSRVEDRS/NMSLWTQNITVCPVKNV IRDGG 
FGPWSPWQPCEHLDGDNSGSCLCRARSCDSP 
RPRCGGLDCLGPAIHIANCSRNGAWTPWSSW 
A I rcT^irnTGFOVRORSCSNPAPRHGGRlCVG 
KSREERFCNENTPCPVPIF 


105 


1455 


A 


1379 


2 


396 


fiT r,T 1 YT TFAAVEGVMRVIGGSNHLAWLDD 
niAVmSIFVWFIFISlAQTMKTLRLRKNTVCT 
SLYRHFKNTLIFAVLASIVFMGWTrKTFRIAK 
CQSDWMERWVDDAFWSFLFVSLILIVIMFLW 

RPSA 


106 


1456 


A 


1383 


1 


432 


EDGHGGWS5RCL Y unABlsijHiaii' WKKLU vv 
ORGGHEIRFAFYFPGHPLLSPQICLAPETPPRG 
CPPVSSLHFISLQ/RLPRDCX}ELFQVGERQSGL 
FEIQPQGSPPFLVNCKMTSGTFWTCRTDSRVF 
ONANPSNAAHSEDQPTP 


107 


1457 


A 


1386 


719 


558 


T#FVTRSHSVAQAKCSO V>TAHRSLDLVCj5an 
YPALSLOSSWDHRHTWUFAFL 


108 


" "1458 


A 


1397 


631 


2 


TRVAlSLLCAAltlS^MVQSAGKRWPiOVMLM 
VWLFAFLYSWPIQALLPTYLKTDLAYNPHT 
VANVLSFSGFGAAVGOCV/GGFLGDWLGTRK 
AYVCSU^OUJIPWAIOGAimrVLGIXLF 
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SEQID 


SHQID 


NO: of 


NO: of 


nucl- 


peptide 


eotide 


seq- 


seq- 


uence 


uence 





Met 
hod 



SEQ 1 


Predicted 


ED NO: 


oeginning 


in 


nucleotide 


USSN 


location 


09/496 


correspondi 


914 


ngtofiist 




amino acid 




residue of 




peptide 
sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



Amino acid sequence (A*»Alanine OOystcinc, 
I>»Aspartic Acid, EKjlutamic Acid, 
F=Phenylalanine, G=Glycine, H=Hislidine, 
Msoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P==Proline, 
Q=Glutanune, R=Arginine, S=Serine, 
T-Threonine, V=Vaiine, W-Tiyptophan, 
Y«=Oryrosinc, X=Unknown. ♦^Stop codon, 
/^possible nucleotide deletion, >Fpossible 
nucleotide insertion ^ 

Fqqmlgqgiagilpkliggyfdtdqraaulu 



109 



1459 



no 



1460 



T?5r 



FTYNVGALGGALAPnOALIAQRLDLGTALAS 
LSFSLTFWILRNRRPGKSLVR 



1402 



15 



387 



VLVALPDWTSETVVTEYLGHRVTLPCLYSS 
WSHNSNSMCWGKDQCPYSGCKEALIRTDGM 
RVTSRKSAKYRLQGTIPRGDVSLTILNPSESDS 
GVYCCRIEVPGWFNDVKINVRLNLQRASTT 



1421 



350 



HEDLSSLLTRGSGNQERERQLKKLlSUa>WM 
LAELAFPVGVLATCA*SLLSC*YCVILFPCSCF 
FFHSPDALFSLLLLSCYFPSYCFFYYLFFSSSPL 
CLLLASSPFPLFILLASL 



1426 



344 



FTSTMTKPFEKESEQPA*ATLAFGAQTSTTAD 
QCALKPDLSYLNNSSSSSSTPATSAGGGIFGSS 
TSSSNPPVATFVFGQSSDPVSSYGFVNTAESST 

SDSLLFSQDS KLATTS 

— RSCT*SGASSGPGWTPRTrWWR 



112 



113 



114 



1462 A 



1434 



46 



372 



1463 



1464 



115 



116 



117 



TJST 



1466 



1467 



TTSWTTSCT 

SRRSSQRTCSRACSGAWSRTW*RSS*TSSSSC 
STSCSSSSSRSCGRPGGPLGARGVHTTSCLNSC 
MSSSTTSSTTSTF 



1439 



292 



HEDIMTHYDRLVDE*ALNAGKQRYEKMiSCi 
MYLGEIVRNILIDFTKKGFLLRGQISEMLKTR 
niFLTFLLSNFLIVCVLLFYVSFYLFQSCINFVL 



1463 



396 



1464 



291 



1465 



667 



1479 



337 



381 



KQQAVPEPHSSTTTPQEQEQNWYGQDLLNLQ 
QRTKVHLPGHKTGPAVAKDTPEPVKKEFTVP 

atsqgp*spfseepplppsneevpptlpp*epqs 
edp*kna*lk:qmhaatthwqqhqqhqvgc 

QYHGIMQ ^ ^ 

AGSYPSMVWSCHWGVrQKRRAL*VYS^mi 



GRRKCGQYWPLEKDSRIRFGFLTVSNLGVEN 
MNHYKKSTLEDLNPEVNPGFFFLTLWKQGEN 

NYCN 



LPPQRPA*TDSYSTCNVSSGFLAGQSHNU1H^ 



YWTKYQVWEWLQHFLDTNQLDANCIPFQEF 
DINGEHLCSMSLQEFTRAAGTAGQLLYSNLQ 
HLKWNGDSLFLCLSLPC 



GTSGGPKRVLVTERFPWQNPLPVNRGQAQR 
VLGPSNSFQRVPLQAQKLVSSHKPGQNQKHK 
QLQATSVPHPVCMPLNNTQKSKQPLPSAPEN 
NPEEELASDPNNEESL*RPWALEDFEIGRPLG 

KGK 



118 



1468 



1485 



385 



TYLWL^GNPPFYEKNDGGLFELlLRAKDlil^NS 
PYWDDMSDSAKHFIRPLTGRDP*KPFPCDQPL 
QHPWIEGHTCLDNNIHQAASEPINNNFAESKR 
NLAFLATGWRHMRKLFMGANLEGPGPTVS 

GTTSKHH*LARSURGPFDHDLKPNAATRDgL 



119 



1469 



1486 



120 



1470 



1497 



398 



999 



NIIVSYPPTKQLTYEEQDLGWKFRYYLTNQE 
KALTKFLKWVNWDLPQEAKQALELLGKWK 
PMDVKDSLELLSSHYTNPTVRRYAVARLRQA 

DDEDLLMYL 



MGESPAV^GYFVLAGMNSAGLSFGGGAGKY 
LAEWMVHGYPSENVWELDLKRFGALQSSRT 
FLRHRVMEVMPLMYDLKVPHWDFQTGRQL 
RTSPLYDRLDAQGARWMEKHGFERPKYFVP 
PDKDLLALEQSKTFYKPDWFDIVESEVKCCK 
EAVCVlDMSSFTEFErrSTODQALEVLQYLFS 
NDLDVPVGHIVHTGMLNEGGGYENDCSIARL 
NKRSFFMJSPTDQQVHCWAWLKKHMPKDSN 
I T A .RDVTWKYTALNUOPRAVDVLSBLSYAP 
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SEQID i 
NO: of 1 
nud- I 
eodde i 
seq- » 
uence 


JEQID. I 
^0: of I 
)eptide 

icnce 


lod 1 

i 
1 
{ 


SEQ 1 
DNO: \ 
n 1 
USSN 
39/496 
?14 


Predicted ] 
jcginning i 
[luclcotide 
ocation 
correspondi 
ag to first 
amino acid 
residue of 
peptide 
sequence 


rSulClvU ClIU 

nucleotide 

location 

corresponding 

■r% lact nmino 

add residue 
of peptide 

SCl|UwIIvv 


Amino acid sequence (A^Alanine |C«K;ysteme, 
D-Aspartic Add, EKxlutamic Acid, 
F«Phenylalanine, G=Glydne, H=Histidin^ 
I=IsoIeudne, K=Lysine. LpLeucine, 
M=M^ionine, N=Asparagine, P=Proline, 
Q=Glutamine, RpArginine, S=Serine, 
T^Threonine, V«Valine, W=Tryptophan. 
Y^Tyrosinc, X^^nknown, ♦«Stop codon, 
/=possibIe nucleotide deletion, V=possibIe 
nucleotide insertion 














l^PDHFPSLFCKEMSVGYANGIRVMSMTHi 
GEPGFMLYIPIEYRWGFTMLSTLVSNS 


121 


1471 


A 


1498 


3 


306 


AQFLLVGWDHIL*LIVL'TNLTBLGRTrCDQN 

WPNSPDVLNHGCFYMQCLSKDCnGYVSRE 

MLVAHTHTVEEHTGTHLQYVSWPDHSVPDD 

SSDFVEFEN 


122 


1472 


A 


1533 


121 


329 


"LGLFSFVWTEVLEEPKDFSCETEDFKTLHCT 
WDPGTDTALOWSKQFSQSYTLFES*VGSGYn 

DNFFLA 


123 


1473 


A 


1547 


111 


408 


■DARTTWKPRNGSSGIWPGDGAK*PPAVEQAIi 
RGHVEMIEKLTFLNLHTSEKDKGGNTALHLA 
AKHGHSPAVQVLLAQWQDINEMNEKQQTPL 

HVAADRG 


124 


1474 


A 


1555 




745 


"MTFDDDDKNTYGVALVWKKFQTQSLRLSUb 

HRKSHLWRGIVSITLIEGRDLKAMDSNGLSDP 

YVKFRLGHQKYKSKIMPKTLNPQWREQFDF 

HLYEERGGNODITAWDKDAGKRDDFIGRCQV 

DLSALSREQTHKLbLC^l-iiDuc.vxriLf v v i i 

ASATVSISDLSVNSLEDQKEREEILKRYSPLRI 

FHNLKDVGFLQVKVIRAEGLMAADVTGKSD 

PFCVVELNNDRLLTHTVYKNLNPEWNKVFTL 

♦VALVWKKFQTQSLRLSDLHRKSHLWRGIVS 

ror'nTM AK>rncunT QriPYVTCFRLGHOKY 
ITXrIEGRDLrLAML'oiNVJJL/OL/r X vrki'*vi*v»*Av*^* 

KSKIMPKTLNPQWREQFDFHLYEERGGVIDIT 

A u/nvn A rsvpriDFTHR roVDLSALSREOTHK 

LELQLEEGEGHLVLLVTLTASATVSISDLSVN 

SLEDQKEREEILKRYSPLRIFHNLKDVGFLQV 

KVIRAEGLMAADVTGKSDPFCVVELNNDRLL 

THTVYKNLNPEWNKVFTL 


125 


1475 


A 


1556 


57 


509 


GGPAPNSRYAEP*KNSLAMT*AHADCENYVA 

r^nm "mOTPQTVMT KTRFGNVRVSRELPGHTGY 

lsccrflddsqivtssgdttcalwdietaqqt 
ttftghsgdvmslslspdmrtfvsgacdass 

VT wniRnfiMrROSFTGHVSDINAVS 


12^ 


i47^ 


A 


1592 


3 


178 


KSEKSCVSSLAHFGTSCQRDYDAMVJO- Vt i h 
EMLPTCDLADOHNIKFHYAFALNR*ER 


127 


1477 


A 


1612 


1 


497 


' ■ TESPLLVRPYLPYITKSELHAIMTAOre IIAUS 
VLGAYISFGVPSSHLLTASVMSAPASLAAAKL 
Ti\xn>PTPVfPKlTLKNAMKMESGDSGNLL*AAT 
QGASSSISLVANIAVNLIAFLALLSFMNSALA 
WVGNMFDYPQLSFELICSYIFMPFSFMMGVE 

WPDSFM 


128 


1478 


A 


1619 


286 


■"486 


' "CCMNSKAQESVFKNVLCNPPALSEMPUVKA 
EDEVDFRASSISEEVAVGSIAATLKMKQGPM 


129 


1479 


A 


1627 


I 


395 


- Ttogalrywifgrflcmwaavdyrcctaji 
mglciisidryvgvsyplryptivtqrrglma 

LLCVWALSLVlYlGPLLGWRHPAPEDEnCQI 
NEEPGYVLFSTrGSFiLPLAlMLVMri ivv xi^v 
AKTE 


130 


1480 


A 


1638 


2 


"TSS 


- "dprvRTKIVNRKTTIYEIQDKTGSMAV VUlfLO 
ECHNIPCEKGDKLRLFCFRLRKRENMSKLMS 
EMHSFIQIQKNTNQRSHDSRSMALPQEQSQHP 
KPSEASTTLPESHLKTPQMPPmPSSSSFTKVT 
KDKDIK*LLFNLYSSVEILPEVLHLKT 


131 


1481 


A 


""1651 


607 


3 


- ■ LAEGGDVFDCVLNGGPLPESRAKALhKgMVC 
AIRYCHGCGVAHRDLKCENALLQGFNLKLTD 
FGFAKVLPKSHRELSQTFCGSTAYAAPEVLQ 
GIPHDSKKGDVWSMQWLYVMLCASLPFDD 
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1 SEQlD 1 
NO: of 1 
nucl- ] 
eotide 

1 

uenoe 


SEQ ID I 
HO: of 1 
peptide 
seq- 
uence 


Vdet i 
lod ] 

1 

1 


SEQ ] 

[DNO: 

n 

USSN 
D9/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to iast amino 
acid residue 
of peptide . 
sequence 


Amino acid sequence (A«Alanine OCysteia^ 

rksaAcnnrHc Acid. TWrliitamtC Acid. 

F=Phenylalanine, G=Giycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leudne, 
ivyTsMpthtnnine. N^AsDarasine. P^Prolinei 
Q=Glutamine, R=Arginine, S=Serine, 
T«Thiconine, V=Valine, W«Tryptophan, 
Y«Tyrosine, X«Unknown, *'=^top codon, 
/=possible nucleotide deletion, V=possible 

nucicouuw liuuuuii 














TDEPKMLWQQQKGVSFP'iHLSISADCQDLLK 

RLLEPDMILRPSIEEVSWHPWLAST**KQWQV 

LSNKVGGESKPKKKK 


132 


1482 


A 


1656 


150 


48 


"LVAKSLLYCGOJTLLQLAKNVGNNSFNDIM 
EANLTSPSPKFIPSSDM*mrY»TYFGAWHV 

VDAO 


133 


1483 


A 


1660 


3 


406 


"RKHIKLL1QKLSDVP*ECQNNQL*KLTEICEKE 
KKEFKKK^lDDQRPEKJrEA*SKDKSPMEEEK 
TEMIRS YIQE VGR iIKKLlsJiAl^o kiu^ki-kjuv 
HKEIRQPILDEJCPKGEGSSSFLSETCHEDTSWF 

PNFTP 


134 


1484 


A 


1666 


1276 


466 


PGSTHASARITIY*L*nLSNATEVDNNFSKPPP 

FFPAGAPPASSSSSSSSSSPPTVSTAPPLIPPPGF 

PPPPG APPPSLU' I IBSGHo ovj I UaKo AKAT r i \j 

NVAFPHLPGSAPSWPSLVDTSKQWDYYARSS 

SSSSSSSSSSSSSPRDRDRER^RTRERERERDHS 

PTPSVFNSDEERYRYREYAERGYERHRASRE 

KEERHRERRHREKEETRHKSSRSNSRRRHESE 

EGDSHRRHKHKKSKRSKEOKEAOSEPAPEQE 

STEATPAE 


135 


1485 


A 


1673 




417 


■ PTRPVNSSQAFALVyYTLGALGGNLIAHMUL 
GYRYWAGIGVLQSCESALTHYRLVANHVAS 
DISLTGGSWQRIRLPDEVENPGMNSGMLQE 
DLIQYYQFLAEKGDVQAQVGLGQLHLHGGR 
GV*ONHQRAFDYFNLAA 


136 


1486 


A 


1678 


525 


9 


"IJ^TSLSSAAVSAVSPPPCKI STATTLPPPMl'Sh 
FCVFPSPSMSPSPSEFLSCIASVSRVHSLSSSSS 
GSSSTASSLNFSAIMGSSSATASWVLSTASTPP 
CPSALPSSPAQES*SLAASSSAWPVAGISPSGA 
CTFPAGSASGAAKAPSPSWRCPSFRALFSLLD 

SSSLSL 


137 


1487 


A 


1680 


1 


2999 


AHRDElQRKFDALRNSa-VITDLEEQLNQLTE 

DNAELNNQNFYLSKQLDEASGANDEIVQLRS 

EVDHLRRErrEREMQLTSQKQTMEALKTTCT 

MLEEQVMDLEALNDELLEKERQWEAWRSVL 

GDEKSQFECRVRELQRMLDTEKQSRARADQ 

RJTIESRQVVELAVKEHKAEILALQQALKEQK 

LKAESLSDKU^LEKKHAMLEMNARSLQQK 

LETERELKQRLLEEQAKLQQQMDLQKNHffR 

LTOGLQEALDRADLLKTERSDLEYQLENIQV 

LYSHEKVKMEGTISQQTKLIDFLQAKMDQPA 

KKKKVPLQYNELKLALEKEKARCAELEEALQ 

KTRIELRSAREEAAHRKATDHPHPSTPATARQ 

QIAMSAIVRSPEHQPSAMSLLAPPSSRRKESSr 

PEEFSRRLKERMHHNIPHRFNVGLNMRATKC 

AVCLDTVHFGRQASKCLECQVMCHPKCSTC 

LPATCGLPAEYVTHFTEAFCRDKMNSPGLQT 

KEPSSSLHLEGWMKVPRNNKRGQQGWDRK 

YIVLEGSKVLIYDNEAREAGQRPVEEFEUXP 

DGDVSIHGAVGASELANTAKADVPYILKMES 

HPHTTCWPGRTLYLLAPSFPDKQRWVTALES 

WAGGRVSREKAEADAKLLGNSLLKLEGDD 

RLDMNCTLPFSDQWLVGTEEGLYALNVLK 

NSLTHVPGIGAVFQIYIIKDLEKLLMIAGEERA 

LCLVDVKKVKQSLAQSHLPAQPDISPNIFEAV 

KGCHLFGAGKIENGLCICAAMPSKWILRYN 

ENLSKYCIRKEIETSEPCSCIHFTNYSILIGTNK 

FYECDMKQYTLEEFLDKNDHSLAPAVFAASS 

NSFPVSIVOVNSAGQREEYUXFHEFGVFVDS _ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence ' 


SEQ ID 
KG: of 
peptide 
seq- 
uence 


Met 
Kod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OGysteine, 
I>Aspartic Acid, fiFOlutamic Acid, 
F^Phenylalanine, G^tycine, H=Histidine, 
Msolcucinc, Rei^yslne, LpLeucinc, 
M=Methionine, N^Asparaginc, P=Proline, 
QsGlutamine, R^Aiig^ne, ^Serine, 
T-Threonine, V«Valine» W-^iyptophan, 
Y»Tyn)sine, X«Unknown, *«Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 












• 


ygrrsrtddlkwsrlplafaVrepylfvw 
nslevieiqarssagtparayldipnprylgpa 

r<;<3rTATYt ASSYODKLRVICCKGNLVKESGTE 
HHRGPSTSRR*PASPLPQYQGQRAFLQGRRK 


138 


1488 


A 


1686 


2 


526 


GRPQGPAPGAGSPPESGPGLWAALGCSLVWV 

PLCCLGGAAGRL*ARSGKSGLRRRRAHAGPP 

orT" DrTvrcrTi* P<s APF^lfiORGPLPGPGTGGVCS 

CWTRGCQTTARTAAAAAAPGPAORRPPOQA 

PQNGSCAASASQEAAAPPPMCPPORRWAVAS 

PPETRCPAAPGTRCRRLEAA 


139 


1489 


A 


1693 


3 


376 


FIETKALGCAWFSLCYYLVLYFESSHKVDFVF 
IV*CFSTPPGAQMTIMSQACAERCNIMRLVDR 
RWAGIAKGVGTQKnGRVHLQEQKALGL 


140 


1490 


A 


1704 


3 


376 


ERTNKFIKELIMDGKNLIAATKSLSVAQRKFA 
HSLRDFKFEFICjL/AV IJJUcrKl^lUAoi-Kiirorirx* 
KNLEEQREIMVS*EGCKLISQLSRGKKIWIWK 
LVLVEVVKHLSLGTVVHCNGKMRFPEP 


141 


1491 


A 


1743 


1 


362 


LITNKVFVARELSCLDYHLDSTGSTAVVADQ 
DKLELELVLKGSYEDTQTSFLGTASAFRFHY 

PLRPLTIVKEVTMDVPAPNVRGLNWMG 


142 


1492 


A 


1769 


1 


406 


NNPSTLPRGS^PMSPRTTMGRRRQRRREHKSS 
LSLASSTVGPGGQIVHTErTEVVLCODPLSOF 
GLQLQGGIFATETLSSPPLVCFIEPDSPAERCG 

T f r\\mT\Ty\n CTMi^T ATTJTVwTMPRANOIJLRDA 
LLXJVuDKVLolNUlA 1 mAJiiviEJDru^ V*''*****''* 

ALAHKW 


143 


1493 


A 


1789 


1 


447 


QMLRNGGDQNTVPDYHFADRlkELL-Fl buy 
KNCIP*DTYLRPSALGNIVEEVTHPCSPGPCPA 
NELCEVNRKGCTSGDPCLPYFCVQGCKLGQA 
or\-i?i At>r\r"n Tr\\/P<!QAnPVFPYKlCSCGOSGL 
LENCMEMHCMDLPTDTSALVR 


144 


1494 


A 


1814 


1 


404 


PGRRFRPRLSQAGTDSGS*VFPDSFPSAFAbFL 
Dvm nT:pnr)AYTVltNKPVELRCRAFPATOrYF 
KCNGEWVSQNDHVTQEOLDEATGLRVREVH 
lEYSRQQVEELFGLEDYWCQCVAWSSAGTTK 

srrayvri 


145 


1495 


A 


1827 


26 


448 


"XVEEKJHADTWRSXCLSDFFFHAAKXLCXK'l^ 
nnriATQT <;vanHFGKGNGLTWAEKFOCEGSE 
THLALCPIVQHPEDTCIHSREVGWCSRYTDV 
RLVNGKSQCDGQVEINVLGHWGSLCDTHWD 
PEDARVLCRQLNCGTAL 


146 


1496 


A 


1828 


574 


333 


" QHEGGDLRRRQLGEIQLTVRYVCLRAASAU' 
SMAAET*HHVPASGADPYVRVYLLPERKWA 
PKKKT^JVKRKTLEPLFDBT 


147 


1497 


A 


1855 


1 


372 


■ ERLVLTSEHCLVLTLFWPSWTYHTLLLSRQH 
VRRLPKLTHAEHDHLASIMNKLLTNYDNLFE 
TSVTYSMG*HGAPTGSEAGANWNH**LHAH 
YYPPLLRSDTVRKFMVGSQMLAQAQRDLTPE 

0 


148 


1498 


A 


1879 


568 


7 


LLSALDDKGGTQPSASFSNAFnVCVTACFAU 

lAHTYMAAEYLEKAGRKLGVNVYVEKQGAN 

GIEGRLTADQLNSATACIFAAEVAIKESERFN 

GIPALSVPVAEPIRHAEALMQQALTLKRSDET 

RTVQQDTQPVKSVKTHJCQALLSGISFAVPU 

VAGGTOVA*AV*RQGISSLHDVOVRTWNS 


149 


1499 


A 


1880 


611 


24 


GLNSENALSNEAMERGWQCLRLFAERLQUU' 
PSQIRWATATLRLAVNAGDFIAKAQEILGCP 
VQVISGEEEARUYQGVAHTTGGADQRLVVD 
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seOid s 

NO: of > 
nucl- F 
eotide s 
seq- I 
uenoe 


10: of h 

leptide 

eq- 

lencc 


Act 1 1: 
od [ 
i 
I 
C 

1 ( 


.EQ I 
DNO: I 

D i 

JSSN 1 
)9/496 ( 
)14 


^dieted ] 
^egiiuimg 1 
lucleotide 
ocation 
correspondi 
Qg to first 
omino acid 
residue of 
peptide 
sequence 


Predicted end i 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


^m\no acid sequence (A»Alanine OCysteine, 
>Aspartic Acid, EKjIutamic Acid, 
?sdPhenv1alaniQe G*=Glyclne, H^Histidine, 
[=lsoleucine, K^Lysine, U=Leuc!ne, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Ttyptophan, 
y«Tyrosine, X^Unknown, *=Stop codon, 
/«=possible nucleotide deletion, V^ossiblc 
nucleotide insertion 














irn A <ITPT VTGTGAOTPLFSLSMGCVTWLER 
YFADRNLGQENFDAAQKAAREVLRPVADEL 
RYHSWKEVRGASVTVQALQEIMMAQQMDE 
RTTMEIWPVD 


130 


1500 


A 


1894 


2 


750 


GRVDFFHTDYRPLIRDSNNYVLDEQTQQAPH 

LMPPPFLVDVDGNPHPTKYQRLVPGRENSAD 

EHLIPQLGYVATSDGEVIEQnSLQTNDNDERS 

PESS ILlXjMIRQLQQQ»s;Dt^KMoAiA^i^ i irt^u 

LSNGEETPRRGFRRLSLDIQSPPNIGLRRSOQV 

EGVRQMHQNAPRSQIATERDLQAWKRRVW 

PEVPLGIFRKLEDFRLEKGEEERNLYnGRKRK 

TLOLSHKSDSVGLVSQSRPRTCRRKYF 


151 


1501 


A 


1900 


141 


785 


GKTlQlQnMQNKykl vQKQiKlli'KlNKKA 

MEMQIKKQFQDTCKVQTKQYKALKNHQLEV 

TPKNEHKTILKTLKDEQTRKLAmAEQYEQSI 

NEMMASQALRLDEAQEAfcUv^ALiu-vi-WJ=J^* 
ELLNAYQSKIKMQTEAQHERELQKLEQRVSL 
RRAHLEQKIEEELAALQKERSERKNLLERQE 
RF.IETFDMESLRMGFGNLVTLDFPKEDYR 


152 


1502 


A 


1915 


2 


377 


1 VRi 1 DTORDGLONYEALLGLTNLSGRSDKl^ 
RQKIFKERALPDIENYMFENHDQLRQAATEC 
MCNMVLHKEVQERFLADGNDRLKLWLLCG 
EDDDKVQNAAAGAUVMLTAAHKKUXKMT 

OVTT 


153 


1503 


A 


1921 


1 


237 


"AYQSLRLEYLQIPPVSRAYITACVLTSAAV^^L 
ELITPFQLYFIPELIFKHFQIWRLITNFLFFVPFG 
FNFLLYMIFLYT 


154 


j 1504 


A 


1928 


2 


354 


EMVEGGEGKMCINTEWGGFGDNGCIDUIKI K 

YDTEVDEGSLNPGKQRYEKMTSGMYLGEIV 

RQDLIDLTKQGLLFRGQISERLRTRGIFETKFLS 

OTKSDRLALLQVRRILQQLGLD - 

"TEIAKIKMEAKKKYEKEL'rMFQNDFEKA^jgA 






"A 


rT929 


IT" 


"369 


KSEALVLREKSTLERIHKHQEIETKEIYAQRQ 
LLLKDMDLLRGREAELKQRVEAFESYQLELK 

■ VTRKLPIFIVDAFTARAFRGSPAADCLLIiNtiL 


[156 


p506 


■■"A 


1935 


"T 


■"270 


DEDMHQKIAREMNLSETAFffiKLHPTDNFAQ 
RSCFGLIWFTPTTDLQILTSSILPSIL 
■ ESKVNNEKFRTKSPKPAESPQSATKQLD^^KIA 




1507 


""A 


1936 




'loT " 


A vt: \rvm a mju wpif nnNTICGTMFDFFTHMH 
NKKHTOGQFQKSSDFQKEELQOTFLPPERQG 
" "TTHRLNVTAEPPCTSMPIY WMFDVPHRt;! lA 


]l58 


1508 


"a" 


1939 




■"S3 


NTCPVDLTDYCAQNGFYCLVYGFLPYGSLED 
RLHCQTQACPPLSWPQRLDILLGTARAIQFLH . 
nncDCi mnniK^SNVLLDERLTPKLGDFGLA 
RFSRFAGSSPIOSSM 


159 


1509 


A 


1974 


3 


401 


■ "HTSTARLLLHRGAGKEAVTSDOYTALJHLAAK 
NGHLATVKLLVEEKADVLARGPLNQTALHL 

,u\. ii <« m A T>i rnr*T i/i xCfxi^f CAT 

AAAHGHSEVVEELVSADVIDUrDEQCjl^oALn 
UWVQGRHAQTVETLLRHGAHINLQSLKFQGG 

HGPAATLLR 


160 


1510 


A 


1982 


2 


417 


" KFLKDLEKQYNKEEPHLSHlOSCFLQNQbUl'A 
PfSEYCNNHPGACLELANLMKQGKYRHFFEA 
CRLLQQMIDIAIDGFLLTPVQKICKYPLQLAEL 
LKYTTQEHGDYSNKAAYEAMKNVACLINER 
KRKLESIDKIA 


161 


1511 


A 


1984 


4 


770 


" "rETGSVSLSPSGLEGAESYA VSPILYSSFDVJUi 
LWLETLQGQRHSHTGVKSTPGQSAAELMKLR 
SSHNASKTLNANNMETLIECOSEGDIKEHPIX 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-Alanine OCysteine, 
D=Aspartic Acid, EKJlutamic Acid, 
F=Plienylalanine, G=<jlycine, H=Histidin^ 
I^Isoleucine, K^Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, W=Ttyptophan, 
Y*=Tyrosine, X^'Unknown, *«=Stop codon, 
/t=possibic nucleotide deletion, \ppossiblc 
nucleotide ins^ion 














ASCESEDSICQLIJSVKJUCKK.VLSWl'i'Liviiuu.a 
PASDFSGALETDLKASLFDQPLSnCGDSDTLP 
RPIQDILTILCLKGPSTEOIFRRAANEKARKEL 
KEELNSGDAVDLHU.PVHLLAWFKDFLRSIP 
RKLLSSDLFEEWMGALEMQDEEDRIEALK 


162 


1S12 


A 


19S6 


864 


501 


LLNSGLFSAPDGSNLEMRLTRGGNMCSGRIEI 
KFQGRWGTVCDDNFNIDHASVICRQLECGSA 
VSFSGSSNFGEGSGPIWFDDLICNGNESALWN 
CKHQGWOKHNCDHAEDAGVICSSKD 


163 


1513 


A 


2001 


419 


187 


AVDLSIDESSL TGEl I'PCSKVTAPQPAATNGD 
LASRSNIAFMGTLVRCOKAKGWIGTGENSE 
FGDIINLSTFWHS 


164 


1514 


A 


2012 


284 


597 


■ SLLCLFPGTSTWCKPIVIETQLYV1VAQLFC3U 
SHrraU>SFANKFIKIQAlEILKIRKPNDIETFKl 
ENNWYFWADSSKAGFTTIYKWERETGFYSH 
QSFTR 


165 


1515 


A 


2013 


2 


403 


EDPEELGHFYDYPMALFSTFELFLTIIDGP AN Y 
NVDLPFMySrrYAAFAUATXLMLNLLIAMMG 
DTOWRVAHERDELWRAQIYATTVMLERKLP 
RCLWPRSGICGREYGLGDRWILRVEDRQDLN 
RORIORYA 


166 


1516 


A 


2019 


2 


927 


CCQREGLGLKAWQILLSHGRNGLPGEPASS 

QGLSAASSTPVFHLALQIDSAPDNIDWVEMLF 

NKNMVTERLQNVMVLEQCFSDSSSLYRFLTY 

SYIXAFNVWLLLAPVTLCYDWQVGSIPLVETI 

WDNIRNLATIFLAVVMALLSLHCLAAFKRLE 

HKEVLVGLLFLVFPFIPASNLFFRVGFWAER 

VLYMPSMGYCILFVHGLSKLCTWLNRCGATT 

LIVSTVLLLLLFSWKTVKQNEIWLSRESLFRS 

GVQTLPHNAKVHYNYANFLKDQOR>®aEAIY 

HYRTALNNNKAWDYLCWRFRKTLTDLP 


167 


1517 


A 


2025 


696 


71 


AAASAASSLTVTLGRLASACSHSILRPSGPGA 

ASLWSASRRFNSQSTSYLPGYVPKTSLSSPPW 

PEWLPDPVEETRHHAEVVKKVNEMIVTGQY 

GRLFAWHFASRQWKVTSEDLIUGNELDLA 

CGERIRLEKVLLVOADNFTLLOKPLLGKDLV 

RVBATVIEKTESWPRIIMRFRKRKNFKKKRIV 

TTPOTVLRINSIEIAPCLL 


168 


151S 


A 


2046 


2 


366 


HLQVAARVFMPLQAVDSAPKPLKGQAQAPg 
RLQGAARVFMPLQAQVKAKASKPLQMQIKA 
PPRLRRAARVLMPLQAQVRAPRLLQVQSQVS 
KKQQAQTQTSEPQDLDQVPEEFQGQDQVLR 


169 


1519 


A 


2049 


1 


945 


QNLEDREVLNGVQTELLTSPRTKDTLSDMIH 

TVEISGEGGPLGIHWPFFSSLSGRILGLFIRGI 

EDNSRSKJIEGLFHENBCIVKINNVDLVDKITA 

QAQDVFRQAMKSPSVLLHVLPPQNREQYEKS 

VIGSLNIFGNNDGVLKnCVPPPVHGKSGLKTA 

NLTGTDSPETDASASLQQNKSPRVPRLGGKPS 

SPSLSPLMGFGSNKNAKKJKIDLKKGPEGLOT 

TVVTRDSSIHGPGPIFVKNILPKGAAIKDGRLQ 

SGDRILEVNGRDVTGRTQEELVAMLRSTKQG 

ETASLVIAROEGHFLPRELVMFRSQSH 


170 


1520 


A 


2050 


363 


1 


■ PVATHLTiaLNSDEHAVVISSAKTLCETVKDi' 
VAKVEKTYDKTLENAWADAVASKCSVLNE 
KLEQLLQALHTDSQAAPVLPGLSPLIVEEDAV 
ESSSEESLGESKEQLGDDVTKPSSOKA 


171 


1521 


A 


2055 


139 


675 


' IPSRPWLORITGLDPAOPLFNQKPHQDRLUPS 
bAQFVDVIHSDTDALOYKEPLGNIDFYPNGG 
LDQPGa*KmGGFQYFKCDHQRSVYLYLSSL 



150 



wo 01/57188 



PCT/USOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq* 
ueace 



SEQID I Met 
NO: of I hod 
peptide 
seq- 
uence 



SEQ 


Predicted 


ID NO: 


beginning 


in 


nucleotide 


USSN 


location 


09/496 


correspondi 


914 


ng to first 




amino acid 




residue of 




peptide 




sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



Amino acid sequence (A^Alanine C==CystBme, 
D-Aspartic Acid, E=Glutamic Acid. 
F=Phenylalanine, G=Giycine, H=Histidine, 
I=Isol«iclne, K=Lysiae, L==Leucine, 
M^Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T«Thrconine, V=Valine, W'=»Tryptophan, 
Y^Tyrosine, X=Unknown, ♦"Stop codon, 
y^ossible nucleotide deletion, \=possible 
nucleotide insertion 



RESCTITAYPCDSYQDYRNGKCVSCCirsgKJti 
SCPLLGYYADNWKDHLRGKDPPMTKAFFDT 
AEESPFCMYtfYFVDirrWNKNVR 



172 



1522 TA 



2056 



361 



173 



T523 TA 



2060 



174 



1524 TA 



2071 



74 



LIQHKSAVEYAQSHLSLVSMCKESHKCSKFK 

mewkvkjrsdgtryttkrpvrdrilkeralki 
keersglttdddtmsemkmgrywskeerkq 
hlvrgkeqri^uiefmmrirlkclkes 



387 



443 



"GTRILSMQIPFVOFQPIRTSliHMAAAaVFALL 
QAYAFLQYLRDRLTKQEFQTLFFLGVSLAAG 
AVFLSVIYLTYTGYIAPWSGRFYSLWDTGYA 
KMPIIASVSEHQPTTWVSFFFDLHILGCIWA 



LLMGPKAKKSGSKKKKVTKAERLKLLQEEEb 
RRLKEEEEARLKYEKEEMERUEIQRIEKEKV/ 
HRLEAKDLERRNEELEELYLLERCFPEAEKLK 
ngTKLL SOWKHYIQCDGSPDPSVAQEMNT 
AALTWSQPQEFWPMEMQPIVTDMVTVHWV 



175 



T525 TA 



2083 



139 



4ir 



176 



1526 A 



2092 



587 



AESSTVGWLCALFRVTHVGVGATGHGWCG 
RRYLCGLPLPSPAPMPIMSLPEGESRKEREVQ 
RT^OFP YLEPGHELPATTLLAFLAAV 
EGSVNFKFGVLFAKDGQLlDDEMFSNElUSfcl' 
FQKFLNLLGDTITLKGWTGYRGGLDTKNDTT 
GIHSVYTVYQGHEIMFHVSTMLPYSKENKQQ 
VERKRMGNDIVTIVFQEGEESSPAFKPSMIRS 
HFTHIFALVRYNQQNDNYRLKIFSEESVPLFG 
PPLPTPPVFTDHQEFRDFLLVKLINGEKATLET 

PCI 



177 



1527 TA [2103 



44 



427 



GKGQVSLEGRPHRGPLCLGSWWPGSRVPUU 
CDGAWLAWACWVFGNDFPSPASAACSALLG 
CSVSTACLCVPLCSGSPLAPFRRTAALQEGLR 
RAVSVPLTLAETVASLWPALQELARCGNLAC 

RSDLQ 



178 



1528 TA I 2104 



4or 



ALQSTLGAVWLGLLLNSLWKVAESKDQVFQ 



179 



"1529 TA rirn i 



312 



1530 TA I 2116 



PSTAASSEGAWEDFCNHSVSNAYNFFWYLHF 
PGCAPRLLVKGSKPSQQGRYNMTYERFSSSL 
LILQVREADAAVYYCAVEVPNTDKLIFGTGT 

RLOVFPNIQNPD 

PTRSSTRPPSLFVHASAKGGEKEEGDUUH Y L 

MRTESOTGLKKGGNANLVFMLKRNTEPKKG 

SYHFDLERLRAAHELFEREQEHLAPGGISMPL 

PPPLPLPACLG 



366 



TSIKRAIE'rrDVTRSFGWDSSEAWQQHDVge 
LCRVMFDALEQKWKQTEQADLINELYQGKL 
KDYVRSLECGYEGWRIDTYLDIPLVIRPYGSS 
OAFASWCTFHLTACVSLHRIHNSTVV 



181 



T531 I A I 2117 



386 



YGLGAHFGRLFIQAGINENDFYUUAWl;AGR 
NDLQQWIEVDARRLTRFTGVITQGRNSLWLS 
DWVTSYKVMVSNDSHTWVTGKNGSGDMIFE 
GNSEKEBPVLNELPVPMVARYIRINPQSWFDN 

GSICI 



182 



1532 TA 12123 



493 



RTKTDVYILNLAVADLLLLFrLPFWAVNAVM 
GWVLGKIMCKITSALYTLNFVSGMQFLACISI 
DRYVAVTKVPSQSGVGKPCWnCFCVWMAAI 
LLSIPQLWYTVNDNARCIPIFPRYLGTSMKAL 
IQMLEiaOFVVPFLIMGVCYnrARm^DCMP 

Nnas .. . 

TVICCLLAFClGLlFVg 



183 



1533 Ta tiro 



561 



■RQAWHEAFKVRiOSIi. 
RSGNYFVTMFDDYSATULLIVVILENIAVCF 

WOTDKFT^LKDMLGFAPSRYYYYMWKYI 
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wo 01/57188 



PCTAJSOl/03800 



SEQID 

NO: of 

nucl- 
i eotide 

seq- 
I uencc 



SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



184 



1534 



185 



1535 



116 nssr 



T87 11537 



SEQ I Predicted 
ID NO: I beginning 
nucleotide 
USSN I location 
09/496 coirespondi 
914 Ingtofiist 
amino acid 
residue of 
peptide 
sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 



2145 I 3 



538 



IS n538 A 



189 n539 



190 I 1540 



191 n541 



1^ I 1543 



194 11544 



2151 2 



671 



2153 I 2 



400 



2158 [227 



2167 13 



486 



2168 2 



412 



2T79 TM 



399 



2190 [T 



469 



Amino acU sequence (A=Aianine OCystcine, 
I>Aspartic Acid, E=Glutamic Acid, 
F=Phcnylalanine, Q=Glyc!nc H=Histidinc, 
Msoleucine, K=Lysine, L^Uucine, 
M=Methionine, N=Asparagine, P^Proline, 
Q=Glutaniine. R=Arginine, S=Serine, 
T=Threonine, V^Valine, W=Tryptophan, 
Y«Ty«>sine, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 



2197 126 



2236 12 



157 
"IS" 



llSl I 105 



409 



672 



"SPLMLLSLUASWNMGLSPPGYNAWUKAS 
EEFLSYPTWGLAVCASLDVFAILPVPVAFIGR 
RFSMDDGAGPFCSAAYTTTGCRTPYI^ 



HELTVAAADRGQPPQSS VYPVrVT VLD VND 
NPPVFTRASYRVTVPEDTPVGAELLHVEASD 
ADPGPHGLVRFTVSSGDPSGLFH-DESSGTUl 
LAHALDCETQARHQLWQAADPAGAHFALA 
PVTIEVQDVNDHGPAFPLNLLSTSVAENQPPG 
TT vm .H AIDGDAGAFGRLRYHL 



LDKLLDRMHN YNIFNEYILICQ VAATYIKLCxV/ 
PKNNFNGSLVQASYQHEELRREXOMLACSFG 
NKHCHQQASTLISDWISSNRNRIPLNVRDIVY 
CTGVSLIX^EDVWEFIWMKFHSTTAVSEKKIL 
LEALTCSDDRNLLNRLLNLSLNSEWLDQDAl 
DVIIHVARNPHGRDLAWKFFRDKWKILNTRI 
RQKTLEFDFAEPLILAFPIILYTAIDNPPLVREH 



GPMCDKHSAFAEKFHAGFIUYIVHPLWE-IWA 
HLALPDAQDILYTLEDNRNWVDSMIPQSPSPP 
LDEQNRDWQGLLENLHVELTLDEEDSEGPEK 
EGEGQTYFTSSKTLCGXVPQNTOSLGETGIHIC 

AHDKSP 



FNCFRVASDSFLENSSLLlMlLPLRNATQlihUK 



PGAVAYTCNPSTLGGWGGWITRSGVRDQPG 
QHGGTPS 



AHLGGAWLTQRSLGSWAAPGPARAAKliVYA | 
CIPQNQKMNIWRMKTSKHLQLLSFVLGAVSP 
AVVVPYMMVLQENGYGVEEGIPTLLMAASS 
MDDILATTGFNTCLSIVFSSGCARSSGSRNSKS 
LRTPLGTICEGCDDSSIFSHLDHSSKWSSTYG 

HSGA 



EFLSSNQlTQLPN'n'FKPMPNLRSVDLSYNJU. 



OALAPDLFHGLRKLTTLHMRANAIQFVPV^ 
ODCRSLKFLDIGYNQIJCSLARNSFAGLFKLTE 
LHLEHNDLVKVNFAHFPKLISUISLCXRRNKV 

AIWSSLDW 



Mm^QKfUlE^^ 
l^/QCADELLRWTTSEPLTLEHEYAMQRTWLED 

AYECTFIVLDAEKRHAQPGATEESCavIVGDVN 
1 FT .TDLEDLTLGEEVLIAEP 



CLDRAAGIRHERNVIYINFrHTRHRGWLAKK 
LSY\T.FIQERDVHKGMFATWrENVLNSSRV 
QEAIAEVAAELNPDGSAQQQSKAVNKVKKK 
AKRILQEMVATVSPAMIRLTGWVLLKLFNSF 
FWNIQIHKGQLEMVKAATETNLPIXFLPVHR 

gjl 

PSKXGGIRLLL'rGTQLYGRI''USAIAPLGDLUK 



DGYNGE GREEPY 

EYFPNSlWRSLFSTMDLGUlGFYTyRlLQALS 



YTHSKGIMHRDVKPLNILCNSPRNKVILADW 
GLAEFYHPMRKYSVHVATRYYKSPEILLDYE 
YYDYSLDIWAVGVILLELLTLKLHVFEGGDN 

EQ 



RKGVGKMPrSEGRPGgERSUW VTSYKVMUS 
NDSHTWVTVKNGSGDMIFEGNSEKEIPVLNE 
LPWMGARYIRINPQSWFDNGSICMRMEILGC 

PLPDPNNY . 

MGVASDWTKRIEYQPGSGSMPLFPSIHL&rCD 

AVSSLQIVTELOTNYIGKGCDRETYSEKSLQ 
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wo 01/57188 



PCTAJSOl/03800 



SEQlD b 
NO; of > 
nucl- p 
eotide s 
seq- I 
ueace 


EQID 1 N 
10: of h 
eptide 
eq- 
tence 


let 1 S 
od H 
1 i 
I 

C 
S 


EQ 1 P 
DNO: b 
n n 
JSSN 1 
►9/496 c 
►14 I 
t 
] 
1 

1 


redicted I 
^ginning r 
ucleotide 1 
ocation ( 
orrespondi t 
Igtofirst ! 
imino acid < 
esidue of 
[>eptide 
sequence 


>redicted end J 
lucIeotidc 
ocation 
x)rresponding 
0 last amino 
icid residue 
of peptide 
sequence 


fijnino acid sequence (A^Alanme OCystcine, 
CNAspartic Acid, E-Glutamic Acid, 
F=Phenylalanine, G=G!ycine, H=Histidme, 
NIsoleucine, K=Lysine, L=Leucine, 
W^Metnioninc, * 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V^Valine, W-Tiyptophan, 
Y=Tyrosinc, X«Unknown, *'=Stop codon, 
/=possible nucleotide deletion, \Fpossiblc 
nucleotide insertion 














KLCGASSGIlDLLPSPSAAJNWTAGLLVDSSlj 
MIFKFDGRQGAKIPDGIVPKNLTDQFnTMW 
MKHGPSPGVRAEKFTILCYSDKTEMNRHHY 
ALYVHNCRLVFLLRKDFDQADTFRPAEFHW 
KLDQQAl^VTCQPGKSrrRQLQEMPVTIQG 

ISLKPS 


196 


1546 


A 


2256 


1 


396 


niGTPVSGLTNRDTLAVllWFREPIRLKTVKi' 
GKVINKDLRHYLSLQFQKGSIDHKLQQVIRD 
NLYLRTIPCTTRAPRPGEVPOVDYNnSVEQF 
KALEESGALLESGTYDGNFYGTPKPPAEPSPF 

UtrUr V 


197 


1547 


A 


2259 


43 


154 


OLAIEIGVRALUOVFVFTEFLOPFgKVlQPbiil 

WLYKNPLGQSDNIPTRLMFAISFLTPLAVICV 

VKIIRRTDKTEIKEAFLAVSLALALNOVCTNTI 

KLIVGRPRPDFFYRCFPIXjYMNbbMHt^ i vjur 

DLVSEGRKSFPSmSSFAFSGLGFTTFYLAGKL 

HrFTESGRGKSWRLCAAILPL 


198 


"1548 


A 


2275 


3 


404 


^CT^WVIPRMLVDFLSbSKTISLPECAly^a^♦l' 
FLGFASNNCFIMAAMSYDRYTAIHNPLQYHT 
T x>mjKTn OMMMASWMVGFLFSLCIIVTVFN 
LSLCDLNTIQHYFCDISPWSLACNYTFYHEM 

AIFVLSA ^ =— 

LTOMh'KlHALSAIESTILLAMAFDRY V AWJhPL 


T99 


1549 


"A 


"23T5 


T 


"375 


lUlAAVLNNTVTAQIGIVAVVRGSLFFFPLPLLI 
KRLAFCHSNVLSHSYCVHQDVMKLAYADTL 
PNWYGLTAILLVMGXDRMFISLb i r 


200 


1550 


A 


2334 


2 


409 

"512 " 


TR\nU'OQRKMSFFFK IbLGEKLVTJKJ^Lhlil Ul- 
SDDPMLPSPDQLKKKAPFTNKKLKAHQTPVD 
ILKQKAHQLASMQVQAYNGGNANPRPANNE 
EEEDEEDBYDYDYESLSDDNILEDRPENKSCH 

dolqfeykeem 

isweaqiaehqwysdekuargylqalaskm 


"20l 


1551 


"A 


tlBo 


Tl 




TEELEALRSSSLGSRTLDPLWKVRRSQKLDM 

sarlelqsaleaeirakqlvqeelrkvkdan 

LTLESKLKDSEAKNRELLEEMEILKKKMEEK 

fradtgklmlcdsalfeykyfsnecfyflfd 

LIVTLEAPTEFQIQY 


202 


1552 


A 


2351 


1 


"ToB 


" TSSYSSDELSPGEPLl SFP WAPLGAFliRl'linLi. 
NRVLERLAGGATRDSAASDILLDDIVLTHSLF 
LPTEKFLQELHQYFVRAGGMEGPEGLGRKQA . 
CLAMLLHFLDTYQGLLQEEEGAGHmCDLYL 
T TKnrT\T?oT vnnT ppnTT RT HOT.VETVELKIPE 
ENQPPSKQVKPLFRHFRRIDSCLQTRVAFRGS 
DEIFCRVYMPDHSYVmSRLSASVQDILGSV 
rrnvt rw/QTi-cvAaxivn^i TLVAVSSSGEKVLLQ 
PTEDCVFTALGINSHLFACTRDSYEALVPLPE 
EIQVSPGDTEIHRVEPEDVANHLTAFHWELFR 
rVHELEFVDYYFHGE 


203 


1553 


A 


2361 


2 


403 


■ NNLNCAEPLFEQNNSLNVNl'N JigiOCl V WLIH 
GYRPVGSffLWLQNFVRiaNEEDMNVIVVD 
WSRGATTFrYNRAVKNTRKVAVSLSVHnO^ 
LKHGASLDNFHFIGGSLGAHISOFVGKIFHGQ 

l.GRITGLDP 


204 


1554 


A 


2390 


280 


476 


- CPSLLPQCLMSLSDLSLSPAPPSHLSPRCFSI'V? 
AGSRLGAMRRCAREMDATPMPPAPSCPSERV 


205 


1555 


A 


2400 


543 


745 


AAVALRDISWQQPYPM0F Y AGSSLGPWri VN 
HGQDRRPHAPGRPARGKVQEGSARPPSAVAC 

EDCSCR 
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PCTAJSOl/03800 



SEQU^ i 
NO: of 1 
nucl- I 
eotide s 
seq- I 
uenoe 


;eqid ^ 

40: of t 
peptide 
cq- 
lence 


lOd I 
i 
1 
( 

1 I 


EQ 1 I 
DNO: I 
n I 
JSSN 1 
)9/496 ( 
?14 1 


'redicted J 
jeginning i 
mclcotide 
ocation 

ff^<»oi^ Attn 1 

Xjrrcsponui 
ig to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end i 
lucleotide 

coiresponding 
D last amino 
acid residue 
of peptide 
sequence 


\mino acid sequence CA=Alanine OCysteine. 
CHAspartic Acid, B-Glutamic Acid, 
'^Phenylalanine, Q^Glycine, H^Histidine, 
plsoleucine, K=4-ysine, D=Leucine» 
M«Methionine, N=Asparaginc P=Prolinc» 
Q=Glutamine, R«Arginine, S^Serine, 
T=Thrconine, V=Valine, W«Tiyptophan, 
Y==Tyiosine, X^Unknown, ♦=Stop codon, 
/'=^ossible nucleotide deletion, \=possiblc 
nucleotide insertion 


206 


SS6 


A 


2406 


122 


485 " 


DLSPDSREDHPQGHRRLLPKRPVRGSLMPUH 
THHPCPVSSTTNDTPDQIWVSVGSLRMGTGG 
MGANASTSPRCWDLSSGNKKWnQVPILASIV 
P<;Rr,nLLATGVGGMCACVPRNQPLTGT 


207 


1557 


A 


2409 


289 


418 


LWTLYWiKQQVQHNHSNRLSCRPSQEDRAT 
HTIMVIDKENTLS 


208 


1558 


A 


2413 


64 


492 


VOGTGXXFIAFTEAMTH1''PASPVWAOMF1''L 
\aiNLGLGSMIGTMAGlTTPTOTFKVPKEMFT 
GGCCVFAFLVGLLFVQRSGNYFVTMFDDYSA 
TT PT Tl mLKNIAVAWIYGTKKFMQELTEML 
GFRPYRFYFYMWKFVSP _ 


209 


1559 


A 


2417 


3 


877 


EKERLLDEWFTLDEVPKGiCLHLRLHWLlLMP 

NASNLDKVLTDDCADKDQANDOLSSALLILY 

LDSARNLPIRYKTNEPVWEENFTFFIHNPKRQ 

DLEVEVRDEQHQCPLGNLKVPLSQLLTSEDM 

TVSORFQLGNSGPNSTIKMKIALRVLHLEKRE 

RPPDHQHSAQVKRPSVSKEGRKTSDCSHMSO 

SPGPGGSOTAPSTPVIGGSDKPOMEEKAQPPE 

AGPQGLHDLGRSSSSLLASPGHISVKEPTPSIA 

SDISLPIATQELRQRLRQLENGTTLGQSPLGQI 

01 TTP 


210 


1560 


A 


2422 


35 


456 


-RmASDLEPFTPTUQPlSFbAITQPSClKRgKA 
AGNPGSLAATIDHKPCSAPLEPKIQASRNQRW 
rA\mL af<;t tdiaEPASPOVHETPIDASQTQK 
VEPASKSRFTPELQAKVSHSRERALSTMDATP 

HHAQPQRGEG 

RRYSQKLIQHTACQLLRTYPAA'nUDSSNlW 


"211 


"1565 


"A 


[2431 


hi " 


^64 


LMFWLHGIQLVALKYQTDDLPLHLNAAMFE 

AMnnCGYVLKPPVLWDKNCPNfYQKFSPLER 

DLDSMDPAVYSLTIVSGQNVCPSNSMGSPCIE 

VDVLGMPLDSCHFRTKPIHIWTLNPMWNEQF 

LFHVHFEDLVFLRFAWENNSSAVTAQRIIPL 

KALKRGYRHLQLRNLHNEVLEISSLFINSRRM 

EENSSGNTMSASSNmiTEERKCXQTHRVTVH 

GVPG 


212 


' 1562 


A 


2436 


1 


411 


GIRGITGHLGCPINDDPSLTLTVSWVMEUKfi 

YIONGTKKEDDSLTIFAVAKRDHVSDTCOAC 

TDLDHNLDKOYLTVLGEQATPTNRLGALPKG 

RANRTRDLELTYLAERIVRLTWIPGDANNRPI 

TnvnroTEEHO 


213 


15^ 


A 


2445 


1 


■ 1294 


■ T^isSlGCCWVSRSSQlDGLl^AHKiiGP^^ 
WLMPELHPKEQBLELLVLEQFLSILPEELQIWV 

QQHNPESGEESVnXEDLEREFDDPGQQVPAS 

PQGPAVPWKDLTCLRASQESTDIHLQPLKTQ 

LKSWKPCLSPKSDCENSETATKEOISEEKSQG 

LPQEPSFRGISEHESNLVWKQGSATGEKLRSP 

SQGGSFSQVIFTNKSLGKRDLYDEAERCLILT 

TDSIMCQKVPPEERPYRCDVCGHSPKQHSSLT 

i^u/^n>TtJT<^T?i<rPVK'n^nPOKAFSLRSYLIIHQR 

IHSGEKAYECSECGKAFNQSSALIRHRKIHTG 

EKACKCNECGKAFSQSSYLIIHQRIHTGEKPY 

EChmaJKTFSQSSKlJraQRIOTGERPYECNE 

CGKAFRQSSELrrHQRIHSGEKPYECSECGKA 

FST.SSNLIRHORIHSG 


214 


1564 


A 


2461 


1 


615 


' "^GSTISSSRNIFLEDDLAWQSLIHPDSSN iPL 
STRLVSVQEDAGKSPARNRSASITNLSLDRSG 
SPMVPSYEmSPQANRTYVRTETTEDEMCIL 
LDSVQLKDLWKiQCHHSSGMEFQDHRYWLR 
THPNCIVGKELVNWLIRNGHIATRAQAIAIGQ 
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wo 01/57188 



PCTAJSOl/03800 



SEQID I 
NO: of 1 
nucl- ] 
eotide 1 
seq- 
uence 


SEQID 1 
^0 : of 1 
[)eptide 
seq- 
uence 


Viet ! 
tiod ] 


3EQ 1 
[DNO: 1 
in 

USSN 
09/496 
914 


Predicted ] 

jegiiming 

Qucleotide 

ocation 

correspondi 

□g to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acld sBQuence ( A«Alanine CXIysteine, 
D-Aspartic Acid, E^lutamic Acid, 
F=PheDylalanine, G=Glycine, H=Histidin^ 
Msoleucine, K^Lysinc, L=Leuc!ne, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine» R=Arginine, S=Serine, 
T=Thieonine, V«Valine, W=Tryptophan, 
Y«Tyrosinc, X«Unknown, *«Stop codon, 
/=possible nucleotide deletion, \r=po5siblc 
nucleotide insertion 














AMVDGRWLDCVSHHDQLFRDEYALYRFLqv 
T.F55VYCOLECSKLIL 


215 


1565 


A 


2464 


3 


2932 


GPOVRSSQDGMADVFVHLRTAWPRCSFISOQ 

HGPGRHGRRVCSSQDSMADVFVHLRTAWPT 

CSLISGQHGPGESVSYEDDDIPAPASLLHVNA 

AAPALTNPTAPVLCTAPNNTAQKEKVPSGMR 

ORPAGVRISSRTPDLTCAVSTHSTVPGVRISSC 

TPDLTCAVSIHSTVPSVCISSCTPDLTCAVSTH 

STVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHATVPGVRISSCTPDLTCAVSIH 

ATVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSMSTVPGVRISSCTPDLTCAVSIH 

ATVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHATVPGVRISSRTPDLTCAVSIH 

ATVPGVRISSCTPDLTCAVSIHATVPOVRISSC 

TPDLTCAVSIHATVPGVRlSSRTPDLTCAVSm 

ATVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHATVPGVRISSCTPDLTCAVSTH 

STVPGVRISSRTPDLTCAVSIHATVPGVHISSC 

TPDLTCAVSTHSTVPGVRISSRTPDLTCAVSIH 

STVPOVCISSRTPDLTCAVSmSTvra vmao*- 1 

PDLTCAVSmSTVPGVRISSRTPDLTCAVSTHS 

TVPGVfflSSCHTDLTCAVSIHATVPGVfflSSCT 

PDLTCAVSTHTTVPGVRISSRTPDLTCAVSIHS 

TVPGVRISSCTPDLTCAVSTHSTVPGVRISSRT 

rr^j-i 1 TfOTniT T\rDr«\JT>TCQ"nTPnT TPAVSIHA 
PDLTC AVSTHL 1 V riiVKlooK l rJJL» i w/\ v pmrw 

TVPGVHISSCTPDLTCAVSIHATVPGVRISSRT 

PDLTCAVSIHATVPGVfflSSCTPDLTCAVSTHS 

TVPGVRISSRTPDLTCAVSIHSTVPGVHISSCT 

PDLTCAVSTHSTVPGVHISSCTPDLTCAVSTH 

oTnm/^\njTCCiJTPnT TTA^VSIHATVPSVHISSC 

TPOT.TCAVSIHSTVPOXXTSVSOTSTG 


216 


1566 


A 


2477 


1 


414 


' FRTKSYRKGSYRCIVSEWIAEQONWQEIQEK 
A\ro\/ATWTnPTVT RAAVPKNVSVAEGKBLD 
LTOirrTDRADDVRPEVTWSFSRMPDSTLPGS 
RVLARLDRDFLVHSSPHVALSHVDARSYHLL 
VTIDVSKENSGYYY 


217 


1567 


A 


2480 


2 


460 


' -CmCEGPQRFEBYKYLOyKAGLYEAIADHY 
\Acs\n vrOHECVRELATRPORLSPIENFLPLHY 
DYLQFAYYRVGEYVKALECAKAYLLCHPDD 
EDVLDNVDYYESLLDDSIDPASIEAREDLTMF 
VKRHKLESELKSAAEGLGXSYTEPNYW 


218 


1568 


A 


" 2483 


140 


383 


■ "AFSSPHPSPAPQFPECGFYULYDKILLFKHUKl 
SANLLQLVRSSODIQEGDLVEWLSASATFED 


219 


"1569 


" A 


" 2489 


3 


428 


CEEONCSGGALNHFRSRQPIYMSLAGWTCRD 
DCKYECMWVTVGLYLQEQHKVPQFHGKWP 
FSRFLFFQEPASAVASFLNGLASL VMIA^K y k i 
p^rp A q<:pmyhTCVAFAWVS 


220 


1570 


A 


^ 2498 
_ - 


1 


"1297 


- "MDOEAVRbCl DNQCVSLHPQKVDS V ANdAPA 
APKIPRLVQATPAFMAVTLVPSLVTLFWDH 
HHFGREAEMRELIQTFKGHMENSSAWWEIQ 
MLKCRVDNVNSQLQVLGDHLGNTNADIQMV 
KGVLKDATTLSLQTQMLRSSLEGTNAEIQRL 
KEDI^KADALTFQTLNFLKSSLENTSIELHVL 
SRGLENANSEIQMLNASLETANTQAQLANSS 
UQ4ANAEIYVlilGHU)SV>a>UlTQNQVlJ^ 
l^GANAEIQGLKENWNTNALNSQTOAFIKSS 
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wo 01/57188 



PCTAJSOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

09/496 
914 


Predicted 

beginniDg 

nucleotide 

locsftion 

correspondi 

Dgtofiist 

amino acid 

residue of 

peptide 


Predicted end 

nucleotide 

location 

cnrresnnndinfi 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alaninc OCysteinc, 
D=Aspartic Acid, E=Glutaraic Acid, 
F^Phenylalanine, G=Glycine, H=Histidlne, 
Issisoleuicine^, K=Lysine, L^=Leucine> 
M=Methionine, N=Asparagine, F^Prolioe^ 
Q=Glutaminei, R^Arginine, S=^erine» 
T«Thrconinc, V=Valinc, W-Tiyptophan, 
Y«Tyrosinc, X=Unknown. *»Stop codon, 
^=possible nucleotide deletion, \=possible 
nucleotide insotion 














ia)NTSAEIQFLRGHLERAGD£IHVLKRDLKM 
VTAQTQKANGRLDQTDTQIQVFKSEMENVN 
UNAQIQVLNOHMKNASREIQTLKQGMJCNA 
SALTSQTQMLDSNLQKASAEIQRLRGDLENT 


221 


1571 


A 


2501 


3 


500 


RmNNDGLSPLMMAAKTGKIGIFQHnRREV 
TDEDTRHLSRKFKDWAYGPVYSSLYDLSSLD 
TCOEEASVLEILVYNSKIENRHEMLAVEPINB 
IXRDKWRKFGAVSFYINVVSYLCAMVIFTLT 
AY xl^rJJ&ijlrr ir livl 1 vUilJsLJVjay iiLtr i 
GVLFFFTN 


222 


1572 


A 


2508 


3 


395 


DAHCQRKLAMQEFMEINERLmHTQKQKL 
ARHVRDKEEEVDLVMQKVESLRQELRRTER 
AK3CEl^VHTEAIA\EASKI)RKLREQSEHySK 
QL£N£LDuL]UjKyib I orU Vi^olcnl^cl i Jsx 
KTDLEKKS 


223 


1573 


A 


2544 


2 


412 


NDPAIISNFSAAVVHTIVNHIXESMTSLEVTK 

MVDERTDYLTKSLKEKTPPFSHCDQAVLQCS 

EASSNKDMFADRLSKSIIKHSH>KSKSVIPNID 

ICNAVYKESLPVSGEESQLTPEKSPKFPDSQNQ 

LTHCSLSAA 


224 . 


1574 


A 


2552 


401 


1 


OASLCFISTAFrVLTFLIDSCRFSVPElRJ>nELSIii 

CYNIYSIAYlVRLTVGRERIS<^rEEAAEPVl/l 

QEGLKNTGCAnFLLMYFFGMASSIWWVILTL 

TWFLAAOLKWGHEAIEMHSSYFHIAAWAIPA 

VK 


225 


1575 


A 


2563 


724 


1 


MSAiUOBRREKGEEEGEGEKDGDEDEKEEEKb 

VARICKNDMGGSQRVLEKHWTSFLKARLNC 
SVPGDSFFYFDVLQSITDHQINGIPTVVGVFTT 
QLNSIPGSAVCAFSMDDIEKVFKGRFKEQKTP 
DSVWTAVPEDKVPKPRPGCCAKHGLAEAYK 
TSIDFPDETLSFIKSHPLMDSAVPPIADEPWFT 
KTRVRYRLTAISVDHSAGPYH 


226 


1576 


A 


2571 


449 


3 


EGVLFVYGNYVGDVMNFEMAAEMAQEVAIP 

TRTVLTTDDISSSPIEDRDORROVAGNFFIFKV 

AGAACDRGMSLEACEAVTRKANRRTYTMG 

VALEPCSLPQTRRHNFEIGAEEMEIGMGIHGE 

RGVIREKMMPADAIVDHIMDRIFS 


227 


1577 


A 


2575 


3 


,1197 


VLSDLCLFYYRDEKEEGILGSILLPSFQIALLTS 

ELWMKAMLDAALVQlEPVKRVDKrrSENAP 

TKETNNIPNHRVUKPEIQNNQKNKEMSKI^^ 

KKALEAEKYGFQKDGQDRPLTKINSVKLNSL 

P^PVP^OSA(T>ACYlVHYRPINLSSSENKIVNVS 

LADLRGGNRPNTGPLYTEADRVIQRTNSMQQ 

LEQWIKIQKOROHEEETRGVISYQTLPRNMPS 

HRAQIMARYPEGYRTLPRNSKTRPESICSVTP 

STHDKTLGPGAEEKRRSMRDDTMWQLYEW 

QQRQFYNKQSTLPRHSTLSSPKTMVNISDQT 

MHSIPTSPSHGSIAAYQGYSPQRTYRSEVSSPI 

QRGDVTlDRRHRAflHPKVK 


228 


1578 


A 


2583 


3 


330 


LPFLGLGSYLPQGMVMASPEMNPTICSVFEA 
HmXFHATTFRRGFQVTVLVGNVRQTAVVE 
KIHAKVRGTWPFISPEVRKEGGLPQTGRELLD 
PTMGKPHLWWVAA 


229 


1579 


A 


2589 


1 


448 


"DDKNAQGDCRHVKPTSGNAFnCKYPCGKSR 
ECVAPNICKCKPGYIGSNCQTALCDPDCKNH 
GKCDCPNICOaPGHGGATCDEEHCNPPCQH 
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PCTAJSOl/03800 



SEQID i 
NO: of 1 
nucl- ] 
eotide ! 
seq- 
ueaoe 


SEQID I 
^0:of I 
peptide 
seq- 
uence 


Viet' { 
\od ] 


5EQ 

[DNO: 

n 

USSN 
09/496 
914 


r»rcdicted 

beginning 

nucleotide 

[ocation 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 

lact' amifin 
lO labl illlllllU 

acid residue 
of peptide 
sequence 


Ammn flciH qeouence ^A«Alanine C^ysteine, 
D^Aspartic Acid, EF<jIutamic Acid, 
F=Phenylalanine, G=Glycinc, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Metliionine, N=Asparagine, P^Proline, 
Q=<jlutamine, R«Arginine, S=Serine, 
T=»Threonine, V=Varme, W=Tryptophan. 
Y«Tyrosine, X=Unknown, ♦«=Stop codon, 
/"^possible nucleotide deletion, ^r=possiblc 
nucleotide insertion 














"GGTCLAGNLCTCPYGFVGPRCETMVCNRHC 
ENGGOCLTPDICQCKPGWYGPTCSTA 


230 


1580 


A 


2593 


2 


138 


A VTP<? WF A YV ADITOEHERSMAYOLVCMFI 
LYLLYLLRNAFFLR 


231 


1S81 


A 


2595 


18S 


2 


cmJvrntTPWPTFFOKLLEOALKTYPVNPPER 
WEKIAEAVPGRTKKACIKRYKVADLRISK 


232 


1582 


A 


2596 




391 


STVTGQPRJU.LDTAGHQQPFLELKIRANEPOA 
GRARKRTPTCEPATPLCCRRDHYVNFQELGW 
RDWILLPEGYQLNYCSGQCPTHLAGSPGIAAS 
FHSAVFSLLKANNPWPGRTSWCVPTARRPLS 
LLYL 


233 


1583 


A 


2601 


184 


403 


LLFSDEIIMAAPLRIADVTSGLIGGEDGRVYV 

YNGKEmGDMTOKCKSWITPCPEEKVNVLQ 

NSIPYWERIT 


234 


1584 


A 


2614 


178 


335 


PLTLCXPENNIOPPQADA VPDKliU iLjryuoi xi* 
DGSKSSDDQKUSYLWEKTO 


235 


1585 


A 


2616 


2 


896 


DVLEVYGTGVASTRHEMGTLDKHKELlil^LV 

AKFLNVEAAMVFGMGFATNSMNIPALVOKG 

CLILRDEVNHTSLVLGARLLGATIGIFKHNYA 

QSLEKLLRDAVIYGQPRTRRAWKKILILVbUV 

YSMEGSIVHLPQIIALKKKYKAYLYIDEAHSI 

GAVGPTGRGVTEFFGLDPHEVDVLMGTFTKS 

FGASGGYIAGRKARILSPPACLVPNTGSHSLH 

RLTRDLQMNEAMVALVTDRLQGWNSGEGN 

WDRADKFGDL vDYLK.VHoJno/\V r/iooiviorri 

AEOIIRSLKLIMGLDGTTQ 


236 


1586 


A 


2621 


1 


392 


NTSSFPAQPSSPARPSLPHLSQHPSWLLl^LAii 
ADHPQCGRFLPLHEPEPLCPSPSLSYPTLVSS 
WSSPFSSHHGCPPGLYPFPTSPKTIQPPGLAQL 
KMLCIPPGRQQLRGAQSMPGHGALSPLLLPP 

A 


237 


1587 


A 


2628 


398 


1 


DLVCKISGFGRGPRDRSEAVYTTMSGRSil'AL 
WAAPETLQFGHFSSASDVWSFGIIMWEVMAF 
GERPYWDMSGQDVIKAVEDGFRLPPPRNCPN 
LMHRLMLDCWQKDPGERPRFSQIHSILSKMV 

ODPEPPNV 


238 


1588 


A 


2631 


I 


1104 


WSPCSLTCGVuLi^lKXJvruoni-ii-oiuiivuNE»i v 

IIJU)ELCRQPKPSTVQACNRFNCPPAWYPAQ 

WQPCSRTCGGGVQKREVLCKQRMADGSFLE 

LPETFCSASKPACQQACKKDDCPSEWLLSDW 

TECSTSCGEQTQTRSAICRKMLKTGLSTWNS 

TLCPPLPFSSSIRPCMLATCARPGRPSTKHSPHI 

AAARKVYIQTRRQRKLHFVGGGFAYLLPKTA 

WLRCPARRVRKPLITWEKDGQHLISSTHVT 

\rATivn\n VTUDi VPQnAfJVYTCSAGPAREHF 

VIKUGGNRKLVARPLSPRSEEEVLAGRKGGP 

KEALQTHKHQNOIFSNOSKAEKROLAANPOS 

RYDDLVSRLLEQGAPCSSSKKKN 


239 


1589 


A 


2636 


I 


678 


' MKPDNE.LDEHGHVHrrDFNIAAMLPiaili^ll 
TMAGTKPYMAPEMFSSRKGAGYSFAVDWW 
SLGVTAYELLRGRRPYHIRSSTSSKEIVHrFET 
TWTYPSAWSQEMVSLLKKLLEPNPDQRFSQ 
LSDVQNFPYMNDINWDAVFQKRLIPGFIPNK 
GRLNCX)PTFEL£EMILESmHKJKKKRLAKK 
EKDMRKCDSSQTCLLQEHLDSVQKEFniNRE 

KVNRDCI 


240 


1590 


A 


2639 


389 


3 


- ■ ELLDPTTPMRTKCIELLYAALTSSSTDQFKAU 
LWQNFAREIEEHVFTLYSKNIKKYKTCIRSKV 
ANLKNPRNSHLQQNLLSGTTSPREFAEMTVM 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide ' 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino aciQ sequence ^/v^njaiiiuc v^^v/swuiyi 
D=Aspartic Acid, &»GIutainic Acid, 
F=^eDylalanine, G=Glycine, H=Histidine. 
I=Isoleuciiie, K=Lysine, D=Leucinc, 
M=Methionine, N=Asparagine, P=Proline, 
i^^vjjuiamme, n-vvrginuie, ocrmci 
T-Threonine, V«Valine, W-Tiyptophan, 
Y^Tyrosine, X^Unknown, ♦^Stop codon, 
A^possible nucleotide deletion. V=possible 
nucleotide insertion 














£MANK£Llv(^LKAbx IbiW^cnii-ry vu^vji** 
Y 


241 


1591 


A 


2640 


392 


3 


IRLmRCVFMRLATICVLV>TLQSKITSCDDD 
TCDLCGYNQKLYPCWETQVGQEMYKLMIFD 
FIIILAVTIJWFPRKLLVTYCSSCKLIQCWGQ 
QEF AIPDNVLGIVYOQTIC WIU AFf brLLTAM 
Y 


242 


1592 


A 


2642 


405 


I 


YFKJmXLLVGVICVAAAVEKWNLHKRIALR 
MVLMAGAKPGMLLLCmCCTTLLSMWLSNT 
STTAMVMPrVEAVLQELVSAEDEQLVAGNSN 
TEEAEPISLDVKNSQPSVELIFVNEDILDFLMK 
SPLMISQAa 


243 


1593 


A 


2646 


412 


2 


CLAMIKGIQSSGKHYFSSLFPYWLICFLIRAF 

LLNGSroGIRHMFTPKLEIMLEPKVWREAATQ 

VFFALGLGFGGVIAFSSYNKRDNNCHFDAVL 

VSFINFFTSVLATLVVFAVLGFKANVINEKCIT 

ONSETV 


244 


1594 


A 


2650 


1 


1271 


MTTTLIGLLKTARLLRLVRVARKLDRYSEYG 

AAVLMLLMCIFALIAHWLACIWYAIGNVERP 

YLTDKIGWLDSLGQQIGKRYNDSDSSSGPSIK 

DKYVTALYFTFSSLTSVGFGNVSPNTNSEKIF 

SICVMLIGSLMYASIFGNVSAIIQRLYSGTARY 

HMQMLRVKEFIRFHQIPNPLRQRLEEYFQHA 

WTYTNGIDMNMVTNGTCSSCTSDDGHFILVS 

NHHQGGLIYSWNDAASMQRPFNHIKSSLLGS 

TSDSNLNKYSTmKn>QLTLNFSEVKTEKKNSS 

PPSSDKTOAPKVKDRTHNVTEKVTQVLSLGA 

DVLPEYKLQAPRINKFnLHYSPFKAVWDWLI 

LLLVIYTAIFTPYSAAFLLNDREEQKRRECGY 

SCSPLNVVDLIVDlMFUDILmFRTTYVNQNEE 

WSDPASV 


24S 


1595 


A 


2656 


385 


2 


NLTWWPLFRDVSFYIVDLIMLIlfl*LDNVIMW 

WESLLLLTAYFCYWFMKFNVQVEKWVKQ 

MINRNKWKVTAPEAQAKPSAARDKDEPTLP 

AKPRLQRGGSSASLHNSLMRNSIFQNKIHTLD 

PHV 


246 


1596 


A 


2660 


200 


506 


VLVLQMNYYQMLHYYVLhW^VNtsrLArcuri 
LLDMRDCHLIKTNQLSQATALAKLCSDHPEIG 
IKGSFKQTYLVCLCrSSPNGKLIEEVSMFSFlS 

NYFLS 


247 


1597 


A 


2678 


3 


267 


DAWVKNDIIFNQTERKQKISENLKHLASVKV 
VQKNLVFVvGLbQKljAUrE.vorbvrr viLiirr 
VSLSYLEIIFDPAQLCDSSEHnS 


248 


1598 


A 


2687 


1 


404 


DFTTLAAMMRTLFSLFGDVRSDVHRFSVIU' 

GAAIKSVKNPDKKSIENQVLDSLVPLLLYSQD 

ENDAVAEESRQVLTICAQFLKWKLPREVYSK 

DPWHIKPTEAOTICRFFEKKCKGKINILEQTL 

MYSKNPKL 


249 


1599 


A 


2692 


1 


440 


FRRRRRRRERDCAAQGARRHCRHLAECKLV 

SFPIGmCVLRNVSGQIHLITLANNELKSLTSK 

FMTTFSQLRELHLEGNFLHRLPSEVSALQHLK 

AIDLSRNQFQDFPEQLTALPALETINLEENEIV 

DVPVEKLAAMPALRSINL 


250 
251 


1600 
1601 


A 
A 


2693 

2694 


459 

2 


21 
404 


LLPGSLGVPILHSQPWDPSPQCPHRAPSTPKRL 
PPLGALSQALTFLSRAAKNHSQDPGKGTKPFP 
AAPAAPPPRSSLPAPLPMGLKDKGPQPAPPTIF 
NSPWHPATLPGALGPQLSQAAPSPIPPPCLMO 

ISSCPDLKLTXSSTP 
■ FVFDLKLRVPGFAALLIHGASSVPGPHl'VRLR 
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PCT/USOl/03800 



SEQID b 
NO: of I 
nucl- { 
eotide s 
seq- I 
uence 


EQID 1 ^ 
40: of h 
icptide 
icq- 
lence 


Ati i 
lod I 
i 
\ 
( 


;eq I 

DNO: I 
n 1 
JSSN 1 
)9/496 i 
314 1 


>redictcd 1 
)egmning i 
ludeotide 1 
ocation 
Mrrespondi 
[ig to first 
Bmmo acid 
residue of 
peptide 
sequence 


'redicted end i 
luclcotidc 
ocation 
conesponding 
to last amino 
acid residue 
of peptide 
sequence 


^mino acid sequence (A=Alamne OC^ine, 
>Aspartic Acid, EFGlutamic Acid, 
7— pVienvlalanineL O^Olvcine. H'Histidinek 
I=Isoleucine, K=Lysine, Iy=Leucine, 
M=Methiomne, N=Asparagine, P=Proline, 
Q=Glutanune, R=ArgUiine, S=S6rine, 
T^Threonine, V==Valine, W-Tryptophan, 
VesTvmwne Xs=Unknown. **=StOD codon. 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














QKRKKKAPDHSSGKlUiELVTrHTVDKLb-l 
mrrpVT rn\ qOFT LHSLLLPRRKTEKRALGSH 
RKAGFPEHPVAPEPLSNSOQISKEGREQVLSEI 

UAUJUvJ^ 


252 


1602 


A 


2697 


421 


1 


PQKSHSGAYQCFATRKAgTAQDFAIlALiilXi 

TPRIVSSFSEKWNPGEQFSLMCAAKGAPPPT 

VTWAIX)DEPrVRIX3SHRTNQYTMSDGTTISH 

MNVTGPQIRDGGVYRCTARNLVGSAEYQARl 

NVRGPPSIRAMRNrr 


253 


1603 


A 


2698 


65 


401 


ACCQWRRTLIPAKSTTVSCriSlPHHPFRGSYS 
FDDHTTDSEALSRSSHVFTSHPRMLKRQPAIEL 
PLGGEYSSDVPRPLSTQLSSSLLGYFSTLMTG 

AAFTNNIASSTIIL 


254 


1604 


A 


2699 


438 


301 


GQIHSQDDPPFIDQLGFOVAPOFQTFVACgiig 
RVRGPWEAGPGVGY 


255 


1605 


A 


2700 


1 


842 


LQNREDSSEGKKKLVEAEELEEKHRJiAgvs 

AQHLEVHLKQKEQHYEEKIKVLDNQIKKDLA 

DKETLENMMQRHEEEAHEKolULDiiV^NAmiri 

AMDSKIRSLEQRIVELSEANKLAANSSLFTQR 

NMKAQEEMISELRQQKFYLETQAGKLEAQN 

RKLEEQLEKISHQDHSDKNRLLELETRLREVS 

LEHEEQKLELKRQLTELQLSLQERESQLTALQ 

AARAALESQLRQAKTELlib 1 1 AHAtitsoi 1 

VGl .GSNIFRLLKAS ARMSVELALSDLAHP 


256 


1606 


A 


2701 


2 


405 


FVGGPGADPPVAVMWDPRAARMDLI AT Aii 
LLKESGNQVLKNGNFSLAIRKYDEAIQILLQL 
YQWGVPPRDLAVLLCNKSNAFFSLGKWNEA 
FVAAKECLQWDPTYVKGYYRAGYSLLRLHQ 

PYEAARMFFEGLR 


257 


1607 


A 


2702 


2 


399 


FVESASSRPPGCFSGDGRFWLVSEGSRRC3WD 
FNPSFSFLDPRYSVGGDENIGTVTTLANILREF 
NPSLKGFSVGTGKETSPNAFLNQAVAGGRAE 
DLPVQARRLVDLNKNDTOIHFQEDWKIITLFI 

GGNDL 


258 


1^8 


A 


2709 


I 


1097 


S VGARQGEAKUKlRRrr fJvul/irfC v ijVi«.v ^ 

MTRKELLTVYSSEDGSEEFEnVLKALVKACG 

SSEASAYLDELRLAVAWNRVDIAQSELFRGDI 

QWRSFHLEASLMDALLNDRPEFVRLLISHGLS 

LGHFLTPMRLAQLYSAAPSNSLIRNLLDQASH 

0 A r>Tv A 15 A I vrwiA API "RPPDVGHVLRMLLG 

KMCAPRYPSGGAWDPHPGQGFGESMYLLSD 

KATSPLSLDAGLGQAPWSDLLLWALLLNRA 

QMAMYFWEMGSNAVSSALGACLLLRVMAR 

LEPDAEEAARRKDLAFKFEGMGVDLFGECYR 

SSEVRAARLLLRRCPLWGDATCLQLAMQAD 

A p A T?P AnnGVOSLPTOKWWGDMARR 


259 


1609 


A 


2721 


1 


403 


' "VYLGAGPGLFFSNEGAKEOHKANIPKLMLrR 
GGFSQREMVTGERSPSPEEEEEEEEEGFGERA 
SCRRGlJFRVRLTRVGLAAPSKASRGQfctJl^AA 
PKSPVRmCSPKFRFPRVSLSPKARSGSGDQEE 

GGLRVRLP 


260 


1610 


A 


2728 


I 


477 


" LLOGDLRYHLQQNVHFTEGTVKLYICKLALA 
LEYLQRYHOHRDIKPDNILLDEHGHVHITDFN 
lATVVKGAERASSMAGTKPYMAPEVFQVYM 
DRGPGYSYPVDWWSLGITAYELLRGWRPYEI 
HSVTPIDEILNMFKVERVHYSSTWCKGMVAL 

1 . 


261 


1611 


A 


2730 


3 


" 547 


" "LTITDFILVLYRYYRSPLVQIYEIEQHKIETWR 
EIYLQGOFmVSISPNDSI^VYTUKNRIH 
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PCT/USOl/03800 



SEQID J 
NO: of 1 
jiucl- J 
eotide i 
seq- 1 
uence 


SEQID I 
^0: of 1 
xptide 
seq- 
uence 


Viet ! 
lod 


SEQ 

DDNO: 

n 

assN 

09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 

corresponding 
to last amino 
acid residue 
of peptide 


Amino acid sequence (A»Alanitte 0=Cysteine. 
I>Aspartic Acid, EKJlutamic Acid, 
FHPheaylalanine^ G^Glycine^ H»Histidine^ 
I=Isoleudnc, K^Lysine, L=Lcucinc, 
M=Methionine, N=Asparaginc, P^Proline, 
Q=Glutamine, R=Aiginine, S=^erine, 
T^Threonine, V«Valinc, W«Tiyptophan, 
Y^Tyrosinc, X=Unknown, •«=Stop codon, 
/^possible nucleotide deletion. V^possible 
nucleotide insertion 














RLPVLDPVSGmXHILTHKRLLKFLHIFGSLU' 
RPSFLYRTIQDLGIGTFRDLAWLETAPILTAL 
DIFVDRRVSAlAVVNECCnHPQDERLGLGW 
. GLGEPOSEERLFPAATTSR 


262 


1612 


A 


2733 


3 


A'3 1 

431 


GPEFPGSAKLVFLDLSYNNLTQLGAGAFRSA 

GRLVKLSLANNNLVGVHEDAFETLESLQVLE 

LNDNNLRSLSVAALAALPALRSLRLDGNPWL 

CDCDFAHLFSWIQENASKLPKGLDEIQCSLPM 

ESRRISLRACRRPASRV 


263 


1613 


A 


2736 


2 


343 


D AT? Tcr.vnppVRK ATKGGENCSFEDNKNWQF 
LWGLNGNFNFFKEPWGGRNNHAKGFRTTW 
ARSSSQNNRTFQNNRNFLRLQRDSQKKGQFA 
DT TQPr VMT PO55PGGLEF0Y0AT 


264 


1614 


A 


2738 


2 


245 


RAMLKCLREGQPPPSYNWTRLDGPLPSGVRV 
DGDTLGFPPLTTEHSGIYVRHDTNEFSSRDSH 
nT\/n\n nPPPri'JKrKOVDL 


265 


1615 


A 


2752 


2 


388 


AAGDAPLRSLEQANRTRFPFFSDVKGDHRLY 
LAAVETTVLVLIFAVSLLGNVCALVLVARRR 
RRGATACLVLNLFCADLLFISAIPLVLAVRWT 
EAWLLGPVACHLLFYVMTLSGSVTILTLAAV 


266 


1616 


A 


2755 


192 


1. 


"AFREVGGYWGLLCEHLYAiPSKTSEONWrAK. 
UJGYLPLQDAFHIFQDPLTGDLPWPELDLGLP 


267 


1617 


A 


2760 


434 


714 


ASRLEKQNSTPESDYDNTPNDMEPDGMG.YM 
HRTSVPGEGLPRARDLAGLGQQKQFTTHTPF 
LYFOTHKGLKDSSIRSEVTCLGISQCWRKGFF 


268 


1618 


A 


2762 


1 


405 


lACTFCGQDEWSPERSTRCFRRRSRFLAWOBf 
AVLLLLLLLSLALGLVLAALGLFVHHRDSPL 
\/nAQr.r.Pi APFfiT VCLGLVCLSVLLFPGQPSP 
ARCLAQQPLSHLPLTX3CLS1LFLQAAEIFVESE 

LPLSWAE 


269 


1619 


A 


2772 


3 


243 


TRPAEKIQYLVLiTVMSHPSQAYDKLSLbUJlL 
LIAVLNLLRREVSEHGRHLQQYFNLFVMYAN 
LSia^SFSEFCFDVSY 


270 


1620 


A 


2789 


1 


486 


VEQIAKAEETHSSLSQELQARLQTVIREKEEL 
LQLSIERGKVLQNKQAEICQLEEKLEIANEDR 
KHALERFEQEAVAVDSNLRVRELQRKVDOIQ 
KAYDELRLQSEAFKKHSLDLLSKERELNGKL 

RHLSP 


271 


1621 


A 


2795 


1 


568 


" KEKJlVTVOLPTESIQKNQEDKLKMVJk'Kifi.^^Kti 
FSGSDRGKLPGSEEKNQGPSMIGRKEERLITE 
RKHEHLKNKSAPKWKQKVIDAHLDSQTQN 
FQQTQIQTAESKAEHKKLPQPYNSLQEEKCLE 
VKGIQEKQVFSNTKDSKQEITQNKSFFSSVKE 
SORDDGKGALNIVEFLRKREELHOILSTVKQP 


272 


1622 


A 


2797 


8 


523 


" KCMQGKYAGAMESEPCVCrKADFDCUYCiYb 
RHSNGQCLPAFwFNPSSLSKDCt>Lov^o ILIND i 
GYRKWSNNCTDGVREQYTAKPQKCPOKAP 
RGLRIVTADGKLTAEQGHNVTLMVQLEEGD 
VQRTLIQVDFGDGIAVSYVNLSSMEDGIXHV 
YONXGIXRXTVQVDNSLG'S 


273 


1623 


A 


2801 


" 72 


' 395 


HPSRSNVGPRQLTVWNTSNLSHDNRRKYU-JS 

DEEGQNQLGIRIHQDJPLFPRRRELPALRTTNG 

KADSLWSRNSVMQELSELEKQIQVIRQELQL 

AVSRKTELEEYH 


274 


1624 


. A 


2805 


168 


320 


" ■ ILWLYFETGTWVYPVFAkLSLLGLAALf ^iJUKE 
1 IFIARNGWGETLTHCKRV 
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wo 01/57188 



PCTAJSOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq. 
uenoe 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


CCA 

ID NO: 
in 

ttCCM 
UodIN 

09/496 
914 


beginning 
nucleotide 

correspondi 
ng to first 
amino acid 
residue of 
pcpiiuc 
sequence 


Pmlicf^fl end 

nucleotide 
location 

cnrrftcnondinC 

to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A«Alanine OCysteine, 
D^Aspartic Acid, EKihitamic Acid, 
F^Phenylalanine, GKilycine, H°Histidine, 
I-Isoleudne, K^Lysine^ IMjeucine, 
M=Melhioninc, N=Asparaginc, P=ProIine, 
Q==Gh]tamine, R'^Arghune, S^'Serine, 
T-Threonine, V-Valinc, W«Tiyptophan, 
Y«TyrosinCi X^Unknown, ♦=Stop codon. 
/=possible nucleotide deletion, ^Fpossible 
nucleotide insertion 


275 


1625 


A 


2812 


208 


321 


GSLATCQLSEPLLWFILRVLDTSDALKAFHD 
MGKIIFQ 


276 


1626 


A 


2813 


41 . 


266 


AORSLHGAGDRAWVGISPTDWSPKWELCK 
KYQQQTWAIDLAGDETIPGSSLLPGHVQAY 
QVGPVRRNGEAGPG 


277 


1627 


A 


2817 


3 


410 


LFISYLHTPKHKQHEVLQAMGSILGrrGEEME 
PLFQEBHGTATRWMTGWLEGGSKSVPKTPL 

LPPHNSPGKIK 


278 


1628 


A 


2821 


238 


457 


GLSGPSCSCPHSPLPTUSRAQLETALKWRNYE 
VKLRLLLHLEELQMEHDIRHYDLESVPMTWD 

r V L/l^IS r JKJL* V 


279 


1629 


A 


2822 


342 


1 


PLIPANLPAHSNPLQPLPSLPHFFLPA'J'HX>Fi 
TPPTFSSVPPPLPSLSSE^HHSPLHSEUOTBLQS 
CRLPSRPSVSRELPPQSGPASSVPLAPTPLPDS 
VFSQRHr 1 Arl'r Ad 


280 


1630 


A 


2825 


307 


77 


PSMVWSYHWGVKQKRLALCVFSFEEGGRRK 
CGQYWPLEKDSRIRFGFLTVTNLTGAVGEPG 
VAFQCDGQRRREPTC 


281 


1631 


A 


2827 


81 


381 


KMGTA V W Vr JUi*UiKJUJiw\oV^JivjUU V jlvt/us. 
QDCTPSLKSLVATGNLLDLEETAKAPLSTVSA 
NTTOMDEVPRPQALSGSSWWVSGCVASRS 
VILSLTSG 


282 


1632 


A 


2830 


471 


160 


klJXDKYELEPSPLTQYlLERKSPHTCWQYFV 

YNYPVLLPLLDDLFKVHKLKPNLKWRQAFDS 
YLKTLPPYYL 


Zoo 






2835 


462 


148 


VSPALSLTPTIFSYSPSPGLSPFTSSSCF5FNPEB 
MKHYLHSQACSVFNYHLSPRTFPRYPGLMVP 
PLQCQMHPEESTQFSrKLQPPPVGRKNRERVE 

SSEESAP 


284 


1634 


A 


2836 


2 


384 


■ KTLPRTLLDILADGTILKVGVGCSEDASKLLQ 
DYGLWRGCLDLRYLAMRQRNNLLCNGLSL 
ir^T AFTVT NFPLDKSLLLRCSNWDAETLTED 
QVJYAARDAQISVALFLHLLGYPFSRNSPGEK 
KR 


285 


1635 


A 


2843 


20 


271 

278 


PmPYYSYSGLDRDCSWLPLAKAWLPDVMlL 
VCDRVSEDGINRQQAQEWCIKHOFELVELSP 
FFT PFEDGKCLCVRRKYGTYI 
TAEDVLTVAYEHGVNLFDTAEVYAAGK 


286 
287 


1636 
1637 


A 
A 


2845 
2851 


197 
2 


427 


FVAEVRREWAKYMEVHEKASFTNSELHRAM 
NLHVGNLRLLSGPLDQVRAALPTPALSPKDK 
AVLQNLKRILAKVQEMRDQRVSLEQQLRELI 
QKDDITGSLVTTDHSQMKKLFEEQLKKYDQL 
KVYLEQNLAAQDRVLCALT 


288 


1638 


A 


2859 


2 


"469 


FVNLGILTCIECSGIHREMGAHISR1Q5LELDK 

LGTSELLPAKNVGNNSFNDIMEANLPSPSPKP 

TPSSDMTVRKEYITAKYVDHRFSRKTCSTSSA 

KLNELLEAIKSRDLLAUQVYAEGVELMEPLL 

EPGOELAETALHLAVRTADQTSLHLVE 


289 


1639 


A 


2861 


2 


454 


"FVASGGPATARMSDSQFFCVAEERSGHCAW 
DGNFLYVWGGYVSIEDNEVYLPNDEIWTYDI 
DSGLWRMHLMEGELPASMSGSCGACINGKL 
YIFGGYDDKGYSNRLYFVNLRTRDETYIWEK 
ITDFEGQPPTPRDKLSCWVYKDRIJYFG 


290 


1640 


A 


2868 


1 


378 


FRQGQLYKVFLHGSQGQVYHSQQVGPPGSAl 
SPDLLLDSSGSHLYVLTAHQVDRIPVAACPQF 
PDCASCLQAODPLOGWCVLOGRCTRKGQCG 



161 



wo 01/57188 



PCTAJSOl/03800 



SEQID 1 ! 
NO:of 1 
nucl- I 
eotide 
seq- . 
uenoe 


SEQID 1 
HO: of 1 
[}eptide 
seq- 
uence 


Viet \ 
iod 1 

1 


SEQ ] 

[DNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleQtidc 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid seouence (A^Alanine CfCysteine, 
I>Aspartic Acid, B^Ohitamic Acid, 
F=Phenylalanine, G=Glycuie, H==Ifistidine, 
Msoleucine, K=Lystae, LHLeudne, 
M=Methionine, N«Asparaginc, P^ProIinc, 
n=Glutflmine- R— Arelninc, S=Swine, 
T-Threoninc, V-Valinc, W-Tiyptophan. 
Y=Tvixisine. X^Unknovra, *=Stop codoo, 
/=possible nucleotide delctioa V=possible 
nucleotide insertion 














ragqlnqwlwsVbedshclhiqsllpghhpr 

OE 


291 


1641 


A 


2870 


1 


385 


■ppYi\^TsnsIRO0LLRKRHIGNDIVTlVFQEPGAL 
PFTPKSIRSHFQHVFVIVKVHNPCTENVCYSV 
nvcw «iKnvpPFnPPIPKGVTFPKSAVFRDFLL 
AKVINAENAAHKSEKFRAX4ATRTRQEYLK0 

LA 


292 


1642 


A 


2877 


3 


188 


RPTRPPPATTQSPESTMDTSLKKEKSAILULYI 
DDDD A VPVQPR YV A VHrHGMLVSCW CHL 


293 


1643 


A 


2878 


1 


427 


REKEEEVEHKEDKWKETEKEAEQlilUiJtiUiiL 

r> A nTUunA AlPQaPRTrn^nSRSVLDLVNYF 
GAG T HPL'AAlr owiitv* v^vJ^E'J*^'^ * ▼ * * 

LSPEKLTAENRYYCESCASLQDAEKWELSQ 
GPCYULTLLRFSFDLRTMRRRKILDDVSIPLL . 
LRLPLAGGRGOAYDL 


294 


1644 


A ] 


2879 


109 


245 


QLCCFCTRQTTLIVYILSnGMVIKrKlLULR V 1 
IIVFvTGGVLG 


295 


1645 


A 


2880 


3 


320 


Tassqhgilt^sllfsicktcirtmdhhcpka 
fncflpgaekstidapfslqpflqdskyntals 

LSESISO 


296 


1646 


A 


2892 


209 


363 


sqyshsldyhllqvtknpftlgdssnpuym 
rloefsokmdovrghwpvst 


297 


1647 


A 


2893 


8 


424 


SPXTLXLDTFlLLGIQDNILVULArFPFMAGO 
KLYSTMGRFLl\i^KJySrAL.i%iiiYiM.Y v L»i*m'»ii-»n. 

qgdslaaraiavqkgsighllgfledslaat 

qiqqsqasllhmhnppfeptsvdmmpracra 

llalakvddnhsef 


298 


1648 


A 


1 2894 


310 


445 


■ FWIYFPSFFMTGYLPLGFEFAVEITYPESKiUJ b 
SGLLNASAQVNL 


299 


1649 


A 


2898 


1 


492 


" KIKAKNLT^raLCSIFLGTSTLLVWVGVIRy L 
GYFQAYNVLILTMQASLPKVLRFCACAGMIY 
T ry^vmnnxinyn riPVWnif PFNI NTVAECLFSL 
VNGDDMFATFAQIQQKSILVWLFSRLYLYSn 
SLHYMILSLFIALITDSYDTIKKFQQNGFPETD 

LOEF 


300 


1650 


A 


2901 


1 


445 


■ ■ i^nVWNSLNGASEVTFSVHVKDGGSFFKl U51 
T\rrvppvisJK ADFPKVRAKEOTFMFPENQP VS 
SLVTTITGSSLRGEPMSYYIASGNLGNTFQIDQ 
T Tnnv<?TSOPLDFEKI0KYVVWIEARDGGVPP 
FSSYEKLDITVLDVWDNAPIF 


301 


1651 


A 


2902 


162 


433 


' THFin PLGYCFPLLDKDLQLFSGFNCNFDFLE 
EPCGWMYDHAKWLRTTWASSSSPNDRTFPG 
KPAVSEDMKELRPACSTYFNPRFPYKL 


302 


1652 


A 


2909 


2 


412 


" GPQMLCKiOYFIWVTRSQCQFEWLADIMQEV 
FFhmHODLVSVHTm'QLABKFDLRTTMLYl 
CERHFQKVU^LFTGLRSITHFGRPPFEPFFN 
SLQEVHPQVRKJGVFSCGPPGMTKNVEKACQ 
LVNRODRAHFM 


303 


1653 


A 


2914 


291 


453 


KLNRWLCFFYSWSFGIirLYliMV i'LGAFPYFE 
VPPTSILEHLQRRKIMKRPSSCS 


304 


1654 


A 


2926 


179 


354 


' -PGVPSQALRKAESLKKOSVI^AKVKAQTAF 
NKDVOREIADLGEVGAASLPPSSGPGA 


305 


1655 


A 


2938 


135 


438 


- GMGYLHAKGlLHKDLKSKNVFYONGKVVn 
DFGLFSISGVLQAGRREDKLRIQNGWLCHLA 
PEIIRQLSPDTEEDKLPFSKHSDVFALGTIWYE 

LHAREWP 


306 


• 1656 


A 


2944 


2 


329 


- VRWNSCVNCSCAFGNGASLSTSLGESSUCLW 
EIGKWLSCSLLSFPSPLAVUTTFCIVTVLGREA 
LTKGALWAVFLLAGSALLCAEVTGVIWRQPE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond] 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
10 jasi anuuQ 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCystcine, 
]>A5partic Acid, &=GIutamic Acid» 
F«*henylalanine, G=Glycme, H=Histidinc, 
I«Isoleucine, K=Lysine, L=Leucine, 
M^Methionine, N^^Asparagine, I^Froline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V«Valine, W«Tiyptophan, 
Y=Tyrosinc, X=Unknown. *=Stop codon, 
/=possible nucleotide deletion, \r^ossible 

mirlf^HHp incertinn 

SKTKLSFKVSSSA 


307 


1657 


A 


2950 


2 


411 


PDLSTTEGSHAFLFCKARGSPEFNITWDKDGQ 
PVSGAEGKFnQPSGELLVKNLEGQDAGTYT 
CTAENAVGRARRRVHLmVIJVFITLPGDRS 
LRLGDRLWLR 


308 


1658 


A 


2951 


1 


407 


TvrD-DDi>\/T3T:r>>JT7T7riAP<jnPV^T?TT<2V^TfMFRM 
PTRx A K VKp J^r^ijrl^/iJiw vlvi^^ 1 1 o v l^J^iVLClX^iTl 

DSSLPEEEEDEDKEAINGSGNAENRERHSESS 

DWMKTWSYNQTNSSMDFIWYMMRDETLEP 

LPKNVV^MAYTDTGNflYFTOHNTKTTTWLDP 

RLCKKAKAPEDC 


309 


1659 


A 


2954 


2 


179 


QDFLTLTLTEPTGLLYVGAREALFAFSMEALE 
LQG AVRGG AVOuSRACQRAKf KuA v juu 


310 


1660 


A 


2959 


1 


419 


QDMMERAIIDTFVGHDWEPGSYVQMFPYi'iJ 
YTRDDFLFVIEHMMPLCMVISWVYSVAMTIQ 
HIVAEKEHRLKEVMKTMGLNNAVHWVAWn 
TGFVQLSISVTALTAILKYGQVLMHSHWnW 
LFLAVYAVATIMFCF 


311 


1661 


A 


2963 


3 


465 


MKPQMPGLGAPNGYGPGRGRAGVPGGPERR 

PWVPHLLPFSSPGYLGVMKAQKPGAGEGMK 

PQKPGLRGTLKPQKSGHGHENGPWPGPCNA 

RVAPMLLPRLPTPGVPSDKEGGWGLKSQPPS 

AVQNGIQJGHQrPNGyul'uAD™rWOuJ-iir^ 

KI 


312 


1662 


A 


2967 


3 


405 


WLAQEWSPCTVTCGQGLRYRWLCIDHRGM 
HTGGCSPKTKPHIKEECIVPTPCyKPKEKLPV 
EAKLPWFKQAQELEEGAAVSEEPSFIPEAWS 
ACT\nrCGVGTQVKIVRLQ YLLor dV^ d v AUi-ri 
DECEGPKPA 


313 


1663 


A 


2969 


2 


430 • 


WADNCRQGYLDALRFLERRGLTKEPVLWT 
LVSKEPPAPADGNWDAGCDQRRKGGLSLNW 
KVPHVQVKDVPNFEQLSPELEAALKKACTRD 
PSRWARFWHSGPCQVLTYLLLPCTLPFEYIYF 
RSRRLVVWLrl? V r AUL W W M\i 


314 


1664 


A 


2971 


422 


33 


LDXSHNALQRLRPGWLAPLFQLRALHLDHNE 
LDALGRGVFVNASGLRLLDLSSNTLRALGRH 
DLDGLGALEKLLLFNNRLVHLDEHAFHGLRA 
LSHLYLGCNELASFSFDHLHGLSATHLLTLDL 

SoNRM 


315 


1665 


A 


2973 


1 


525 


ITVSTHASGSPFGLEPQSGWLWVRAALDREA 

QELYILKVMAVSGSKAELGQQTGTATVRVSI 

LNQNEHSPRLSEDPTFLAVAENQPPGTSVGRV 

FATDRDSGPNGRLTYSLQQLSEDSKAFRIHPQ 

rrjpvm on nilEOOSSYOLLVOVODGGSPP 

RSTTGTVHVAVLDLNDNT 


316 . 


1666 


A 


2978 


2 


400 


ELWELVSAGKSGPERNTYEVQWTGNVPKA 

GTDANVYLTIYGEEYGDTGERPLKKSDKSNK 

FEQGQTDTFnYAIDLGALTKIRIRHDNTGNR 

AGWFLDRIDITDMNNErrYYFPCQRWLAVEE 

DDGOLSRE 


317 


1667 


A 


2981 


3 


440 


" VLNCQGRPTRPVRINGDGQEVLYLAESDNVR 
LGCPYVLDPDDYGPNGLDIEWMQVNSNPAH 
HRENVFLSYQDKRINHGSLPHLQHRVRFAAS 
DPSQYDASINLMNLQVSDTATYECRVKKTTM 
ATRKVIVTVOARPAVPMCWTEGQ 


318 
319 


1668 
" 1669 


A 
A 


2995 
2999 


119 

2 


414 
332 


LPEKEFPnRKSSSLKVTKCLFTEQPKPmLRFA 
ENYDARLLRIDIANTLREQVQELFNKTYGKQ 
RRTPGEGHVAAVDREVAGFPVPAEGISGETIH 
■ -OTFATTYGRLVVVEDLHSOAgOHWstiHSAEl 
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SEQID 1 S 

nucl- I 
eotide s 
seq- » 
uence 


5EQID I 

)eptidc 

.eq- 

lence 


4et 5 
lod I 

i 
\ 
( 


JEQ 1 
DNO: 1 
n 

JSSN 
}9/496 

m 


Predicted ] 
[)eginning ] 
aucleotide 1 
ocation 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
aucleotde 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
r\=Aenartir Acid H=^lutamic Acid, 
F^Phenylalanine. G=Glycine, H=Histidine» 
[=Isoleuctne, K=Lysine. l/=Leucine, 
M=Methionine, N^Asparagine, P=Proline, 
Assnh-itumine R=Ar£inine. S=Serine, 
T«Threonine, V=Vaiine, W«Tiyptophan, 
Y^Tyrosine, X«Unknown, ♦'^Stop codon, 
possible nucleotide deletion, V=possible 

IxUdwVlllUC lIlKd Ikwl* 














STLALSHSAQVLASASGRSSTTAHCQIRVWU 
VSGGLCQHUFPHSTTVLALAFSPDDRLLVTL 
rf^nunriPTI AT WGTGHL 


320 


1670 


A 


3000 


693 


322 


IDESminVNYLDYETKISYMMNVSATDQA 
PPFNQGFCSVYITLLNELDEAVQFSNASYEAA 
TTTJxn ATnTCTVPVnAY<?TDNLNOITYRFDAY 
TSTOAKALFKIDArrVRGWGOGAPFFPI 


321 


1671 


A 


3001 


6 


383 


"lUPRGKACXTVLGRSTGELEOFASSRLFPQPC 
ow/r^r^ccm T ^PTTiT DFI MKKDEPPLDFPDTLE 
GFEYAFNEKGQLRHIKTGEPFVFNYREHLHR 


"322 


1672 


A 


3007 


192 


"447 


■ER^TKmFPGRGDSgCACCPSSPVWVFLElW 
nfiAASPRNVGHNIYILAHQLARH 


323 


1673 


A 


3019 


18 


245 


KEIXFYHLIVNNINFFNTRYAKIHIPnASVSEH 
QFltWVSFFFDLHn-VCTFPAGLWFCKNIND 
ERVFGKRGF 


324 


1674 


A 


3020 ' 


523 


797 


LCYFSARYHQRKlruU^ I IT 1 JL.o/ujNKJsx.rx"si-ir A 
YLFIFFEMESHSVTHAGVQRHNLNSLQPLPPG 
FKRFSCLCFLSSWNYRGAPPGPANF 


325 


1675 


A 


3022 


2 


156 


NDFLPLYFGWVLTKKSSEaRKAGQVFLEbL 
ONHKAFKKELROCRWQVGAL 


326 


1676 


A 


3023 


38 


172 


KMVRGSKKLlSFFPOGPYGILAORDPSKGLAl 
FCLNKEALKDEFE 


327 


1677 


A 


3027 


1 


385 


LTLEFLlIJ*AASELAHGKRLACCIVDHJ!vLriiC 
GFYGLYDKILLFKHDPTSANLLQLVRSSGDIQ 
EGDLVEWLSASATFEDFQIRPHALIVHSYRA 
PAFCDHCGEMLFGLVRQGLKCDGCGLNYHK 

RC 


328 


1678 


A 


3030 


13 


569 


1llPTlSCQRPGPQLAAUN4LPYTVNFKVSAKl 
LTGALNAHNKAAVDWGWQGLIAYGCHSLV 
WIDSITAQTLQVLEKHKADWKVKWAREN 

<«i-i*TrkiF/^cin\//^l T>T A C AT^\7MG"lfnVWnVAAGV 
YHHNIGSPYCLRlAoAU YfNUJMl V w u V /\r%.\j t 

aoceiqehakpiqdvqwlwiqdasrdlllai 
hppnyivlwnadtqtklwkksyadnilsfsf 

D 


329 


1679 


A 


3038 


90 


744 


" SVNLPPSLWPWEEAMDSTKSEPLKGSFJbAED 
rwT¥T«wvi \ rvmcrk VT> T7C tn VTOMKWRLGEG 
RGEAVYQIGVEDNGLLVGLAEEEMRASLKTL 
HRMAEKVGADrrVLREREVDYDSDMPRKITE 
VLVRKVPDNQQFLDLRVAVUjNVDSGKSTL 

lgvltqgeldngrgrarlnlfrhlheiqsgr 
tssisfedlgfnskgevhgingtqwgqtlrmo 

w 


330 


1680 


A 


3040 


3 


397 


' LCSTaLLTIPSWVLSQITLKESGPTLMKFltr 

ltltctfsgfslntsgvgvawirqppgkale 

wlaliywdddkryspslndrltiakdtsknq 

vvltnttnmgpvdtatyycaqfargargsn 

WFDPWGO 


331 


1681 


A 


3043 


3 


1509 


" AGIRHHAPPnSNRHRRQlDRGVTHLNlSGLK 

mprgiaidwvagnvywtdsgrdvieyaqmk 
genrlcmsgmidephaivvdplrgtmywsd 
wgnhpkietaamdgtlretlvqdmqwptg 

LAVDYHNERLYWADAKLSVIGSIRLNGTDPI 

vaadskrglshpfsidvfedyiygvtyinnrv 

FKIHKFGHSPLVNLTGQLSHASDVVLYHQHK 

qpevtnpcdrkkcewlcllspsgpvctcpng 
krldngtcvpvpsptpppdaprpgtcnlqcfn 
ggscflnarropkcrcqprytgdkceldqc 
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eotide 
seq- 
uence 


SEQID 
^0:of 1 
}eptide 
seq- 
uence 


Viet 5 
lod I 
i 
1 


JEQ 1 
DNO: 1 
n \ 
JSSN 
09/496 
?14 


Predicted 

jeginning 

lucleotide 

ocation 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 

an\A re<:tdiie 

of peptide 
sequence 


Amino acid sequence {A=AlanineC=Cysteine, , 
[>=Aspartic Acid, BKjiutamic Acid, 
psPhenylalanine* G=Glycinei H=Histidine, 
I=IsoleucUie, K-Lyslne, LpLeudne, 
M=Methioiiine, N=AsparagiJie, P=Proline, 
QMjlutamine, R^Argtaine, S«Serine, 
T=Thrconine, V=Valine, W-Tiyptophan, 
Y^Tyrosinc, X^Unknown, *«Stop codon. 
^=possiblc nucleotide deletion, V=possible 
nucleotide insertion 














■WEHCRNGGTCAASPSOMPTCRCPTGFTOPKCJ 

TQQVCAGYCANNSTCTVNQGNQPQCRCLPG 

FLGDRCQYRQCSGYCENFGTCQMAADGSRQ 

CRCTAYFEGSRCEVNKCSRCLEGACWNKQS 

GDVTCNCTDGRVAPSCLTCVGHCSNGGSCT 

MNSKMNIPECQCPPHMTGPRCEEHVFSQQQP 

ffHTASlLIP 


332 


16S2 


A 


3045 


3 


952 


ITnSNFHTQVNRlYCCG'l'yKAGPMRQlbLVU 
AVDEEVGDYFPEFLDMLEESPFLKMUPWOT 
LSSLRLQCRSQSDDGPIMWVRPGEQMIPTAD 
MPK ^JPFKIOOISMNEDCNLO YLPRTSEPREVLF 
EDRTRAHADHVGQGFDWQSTAAVGVLKAV 
QFGEWSDQPRITKDVICFHAEDFTDWQRLQ 
T ni HFPPVSOCVOWVDEAKLNQMRREGIRY 
ARIQLCDNDIYFIPRNVIHQFKTVSAVCSLAW 
HIRLKQYHPWEATQNTESNSNMDCGLTGKR 
ELEVDSQCVRIKTESEEACTEIQLLTTASSSFP 

PASE 


'333 


1(;83 


A 


3046 


497 


167 


SACSTGPELPGKA'IRSLTRPANQKGCUUURL 
YYDGCA\aAMNGSVFAQGSQFSLDDYEVLT 
ATLDLEDVRSYRAEISSRNLAVSAPVDTCVG 
CSSKTWKVAPFVRAWWRP 


334 


1684 


A 


3053 


37 


276 


-VOTOLEEQLNQLTEDIslAJiLNNgNl' V LSKQLU 
c A Qrs AismFTVOLRSEVDHLRREITEREMQLTS 

QKOVRRVNKVVRSLEDF 


335 


1685 


A 


3054 


2 


"P6 


WDAWGDWSDCSRTCGGGASYSLRRCLTUK 

NCEGQNIRYKTCSNHDCPPDAEDFRAQQCSA 

YKDVOYOGHYYEWLPRYNDPAAPCALKCH 

AQGQNLWELAPKVLDGTRCNTDSLDMCISG 

ICQAVGCDRQLGSNAKEDNCGVCAGDGSTC 

RLVRGQSKSHVSPEKREENVIAVPLGSRSVRI 

TVKGPAHLFIESKTLQGSKQEHSFNSPGVFW 

ENTTVEFQRGSERQTFKIPGPLWIADFIFKTRY 

TAAKDSWQFFFYQPISHQWRQTDFFPCTVT 

CGGG 


336 


1686 


A 


3058 


54 


347 


WOKdEAGAHSDSCCLLHTPPRLTKAHSKKA 
LRNSRIVSQKDDVHVCIMaRAIMNYQVSRG 
AWWRLGSPACPHWGLHKLPRLWDPLSLYP 

VT PWGT 


337 


1687 


A 


3059 


2 


7uy 


■ iLTSLVELTRFETLTPRFSATVPPCWVtVwb 
OQQRRHPQHLHQQHHGDAAQHTRTWKLQT 
DSNSWDEHVFELVLPKACMVGHVDFKFVLN 
SNITNIPQIQVTLLKNKAPGLOKVNGLRLCPF 
LEDHKEDILCGPVWLASQLDLSOHAGMLTLT 
SPKLVKGMAGGKYRSFLIHVKAVNERGTEEI 
CNGGMRPWRLPSLKHQSNKGYSLASLLAK 
VAAGKEKSSNVKNENTSGTRK 


338 


1688 


A 


3060 


85 


384 


' ICAFYNYHVLELLQMLVTOGVSSQLEQJtiLUi!^ 
TM,'"irv/^\7An^rT^ T ^nRNRdCLGIXSLHETIL 
SDVNPRNTFGQLFCGSLDLFGILCVGLYRUDB 

EELNP 


339 


1689, 


A 


3063 


236 


362 


■ CFLCLSGDFMVMTIFFNVSRKFGYYAFQNYV 
PSSVTTMLSWV 


340 


1690 


A 


3065 


3 


1249 


- DLWQFTPLHHAASKNRVEVCSLLLSYUAUPl 
LLNCHNKSAIDLAPTPQLKERLAYEFKGHSLL 
OAAREADVTRIKJaiLSLEMVNFKHPQTHETA 
LHCAAASPYPKRKQICELLLRKGANINEKTKE 
FLTPUfVASEKAHNDVVEVVVKHEAKVNAL 
DNLGQTSLHRAAYCGHLQTCRLLLSYGCDPN 
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SEQID 
NO: of ' 
nucl- 
eotide 
seq- 
uence 


SEQID ] 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino &cid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
to last amino 
acid residue 

01 pcpuuw 

sequence 


Amino acid sequence (A=Alanine C=Cysteinc, 
I>=Aspartic Acid, BF=01utamic Acid, 
F^henylalanine, G=GlycinekHFHistidine, 
I=Isoleucine, K=Lysinci, L^Lcucine, 
M=Mettiionine, N=Asparagine, P=Prolinei 
Q=<3tutamine, RpAi!ginine, S^^Serine, 
T=Threonine, V^Valinc, W«Tiyptophan, 
Y^Tyrosinc X«Unknown, *=Stop codon. 
A^^ssible nucleotide deletion* \=possible 
nucleotide insertion 














IISLQGFTALQMGNEl^VQQLLQEGISLGNSEA 

DRQLLEAAKAGDVETVKKLCTVQSVNCRDIE 

fiROSTPLHFAAOYNRVSWEYLLQHGADVH 

AKDKGGLVPLHNACSYGHYEVAEaVKHGA 

VVNVADLWKFTPUffiAAAKGKYKCKLLLQ 

HGADimCK>nU)GNTPlJ)LVKDGDTDIQDLLR 

GDAALLDAAKKGCLARVKKLSSPDNVNCRD 


341 


1691 


A 


3070 


1 


547 


GVLIPSFQNQLFADILAGIESVTSEHNYQTLIA 
NYNYDRDSEEESVINLLSYNIDGnLSEKYHTI 
DTVVFT P9ATTPVVELMDV0GERLDMEVGFD 
NRQAAFDMVCTMLEKRVRHKILYLGSKDDT 
pnTJnnvnnYmAMMLHNLSPLRMNPRAISSI 
HLRMOLMRDALSANPDLDGVFCTN 


342 


1692 


A 


3073 


463 


3 


RINRCRKPSDADILVPGDTISUGTTSLRIDYNIi 
IDDNRVTAEEVDaLREGEBa-APVMAKTRILR 
AYSGVRPLVASDDDPSGRNVSRGIVLLDHAE 
or^ni nf;pmTnGKlA(ITYRLMAEWAroAVC 
RKLGNTRPCTTADLALPGSQEPAKVP 


343 


1693 


A 


3075 


250 


1 


LUYLAIFAPVAMSALAGVKSVQQVRIRAAQS 
LGASRAQVLWFVILPGALPEILTGLRIGLGVG 
WSTLVAAELIAATHGLOFM 


344 


1694 


A 


3076 


2 


138 


T vrrkAVT n<3T nVA AT<3TPCCLLIGYPLAWAV 
AHSKPSTRNILLLL 


345 


1695 


A 


3078 


469 


3 .. 


LKIRGQRIELGEIDRVMQALPDVEQAVTHAC 
VINQAAATGGDARQLVGYLVSQSGLPLDTSA 
T nAHT "RPTT PPHMVPWLLOLPOLPLIANGKL 
DRKALPLPELKAQAPGRAPKAGSETIIAAAFS 
SLLGCDVODADADFFALGGHSLLAMKLAT 


346 


1696 


A 


3082 


404 


2 


QNITSKDLDVRLDPQTVPIELEQLVLSFNHMI 

ERIEDVFTRQSNFSADIAHEIRTPITNLITQrrEI 

ALSQSRSQKELEDVLYSNLEELTRMAKMVSD 

MLFLAQADNNQLIPEKKMLNLAHEVGKVFD 

OFEALPE 


347 


1697 


A 


3084 


3 


340 


"vfRTTFi^PAPTSKl YTICVHPAYRTLLEKRQALt; 
DEKAKLNGRVTAMPKTQQEIVRLTRDVESGQ 
QVYMQLLNKEQELKnEASTVGDVRIVDPAIT 
npnvT K'PKKnLIILGAI 


348 


1698 


A 


3086 


723 


10 


"TQAMVWQQKACAEDDPQLSGRHWUiAAlL 
YNIAAYPHLKGDDLAEQAQALSNRAYEEAA 
QRLPGTMRQMEFTVPOGAPrrOFLHMPKGDG 
PFPTVLMCGGLDAMQTDYYSLYERYFAPRGI 
AMLTIDMPSVGFSSKWKLTQDSSLLHQHVLK 
ALPNVPWVDHTRVAAFGFRFGANVAVRLAY 
LESPRLKAVACLGPWmiXSGLKCQQQVPE 
MYLDVLASRLGMHDASTKSSTRENH 


349 


1699 


A 


3087 


2 


249 


■ RIRSSDPEirLAGTPLHAAYLIGNTTLICAGFSV 
GFGVAMSQALGPFSLRAOVASSTLGIAQVCG 
SSLWIWLAAWGIGAWNM 


350 


1700 


A 


3099 


3 


424 


EAPEATPQPSQPGPSSPISLSAEEENAEGEVSR 
ANTPDSDITEKTEDSSVPETPDNERKASISYFK 
NQRGIQYIDLSSDSEDWSPNCSNTVQEKTFN 
KDT\aiVSEPSEDEESQGUaMARR14DDISELE 

DLSGMEDLK 


i5\ 


1701 


A 


3108 


2 


404 


IKKNHHGYQLLHRRALFEKRTRLSDYALU'U 
MFGIWMVIETELSWGAYYKAPLYSLALKCL 
ISLFTIILLGLTIVYHAREIQLFMANYGADDWR 
SALTYEPIFLILLEALRGVIHATPCRVSLSLWD 

GLDLP 
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SEQID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ED 1 
NO: of 1 
peptide 
seq- 
uence 


^et 
iiod 


SEQ 

[DNO: 

in 

USSN 
jy/4yo 
914 


Predicted 
Qcginmng 

Mil ^1 A A^l H 0 

[lucieouue 
location 
coirespondi 
ng to first 
omino scid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 

Inratinn 

corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alanme OCystdne, 
I>Aspartic Acid, E«=Glutainic Acid, 
F=4'henylalaninei Q^GIycine, H«Histidinc, 
I=:lsoleudne, K=Lysine, L^Leudne, 
M=Methioninc, N^Asparaginc, P=Proline» 
Q=GIutaminc, R«Aiginine, S=Serine, 
T-Threonine, V-Valinc, W»Tiyptophan, 
Y^^Tyiosinc. X=UnIcnown, '^Stop codon, 
/possible nucleotide deletion, \=possible 
nucleotide insertion 


352 


1702 


A 


3110 


341 


2 


AQU^VCPPQTLLTTNTSSISITAIAAEIKNPER 
VAGLHFFNPAPVMKLVEWSGLATAAEWE 
QLCELTLSWGKQPVRCHSTPOFIVNRVARPY 
YSEAWRALEEQVAAPEVl 


353 


1703 


A 


3111 


3 


LOO 


HFSLFRIAFAVFLTYNn'VGLPU'VlFLFVHHEL 
GYGNTMVGIAVGIQFLATVLTRGYAGMA 


354 


1704 


A 


3116 


367 


ns 


WQLFHLNGTFLNIGETDTESCVNGWYYDRSS 
FPFSNMTEVRGLVFLS 


355 


1705 


A 


3117 


101 


53 


~\/Txn wi tq<:ppPF1 KPVnKESEVYMKl'l^DGF 
EKFSPPILQLDEVDFYYDPKHVIFSRLSVSADL 
ESRICVVGENGAGKSTMLKLLLGDLUPVRGI 
RHAHIWLKIGYFSQHHVGAAGT*TFSACGNL 
LGTQVFLGRPEEEYVRHQLGFGMGISGELGHA 
SSLPACLGGQKEAEVAFCSDGLLPCPNFLVIL\ 
DEPTNmCHGRAIEALGPCLQTISGVGmVS 
HE* SALSRLVCREU.WVC*GRSTSPF 


356 


1706 


A 


3121 


137 


'AST' 


"RGGRDWGEHNQRLEEHQARAWQGAMDAU 
AASREHARWQGTGLAPGTRVAVAPTCVQGL 
PQERSVCRPFFSSRWREGPVWALOAGAHOKP 
RW SvjCjVKUV Vi\.\JVJj\ wr i rrvrn. 


357 


1707 


A 


3124 


1249 


229 


"MLEAPGPSDGCELSNPSASRVSCAGQ^4LliVy 
PGLYFGGAAAVAEPDHLREAGirAVLTVDSE 
EPSFKAGPGVEDLWRLFVPALDKPETDLLSH 
LDRCVAFIGQARAEGRAVLVHCHAGVSRSV 
A1ITAFLNIKTDQU»FEKAYEKLQILKPEAKMN 
EGFEWQLKLYQAMGYEVDTSSAIYKQYRLQ 
KVTEKYPELQNLPQELFAVDPTTVSQCHJCDE 
\n wppvrpuQT TTR^S^TLDHREGSGPIAFAH 
KJRMTPSSMLTTGRQAQCrSYFIEPVQWMESA 
LLGVMDGQLLCPKCSAKLOSFNWYGEQCSC 
nT? WTTP APOTHKNRVDEMKILPVLGSQTGKI 


358 


1708 . 


A 


3127 


816 


139 


EVE-aGPRTPGP/EAQSPTPGSCPGWQEPSFUr 

rPPP^LSGPGPQGAPVLGKLLPDPEETPAGKTP 

LGKHFWWGL\PVTSANFSPGAAA*FGGALSPP 

nTiHi 7GHMLLOGPPSPFRL00Q*QTPPGSHSP 

PTANREINPGPAAAADTRSCWGHKRSWRGW 

RGLAPWRLGFGSPGIP*PAPAGIP/GRPTWEGG 

KGAGGKPSETLTRSPPVWRGKRGSANGFLSW 

VOILO 


359 


1709 


A 


3132 


3 


191 


■ "HEHLLLLLLCVFLVKSQGVNDNEEGFFSARU 
HRPLDKKREDAPNLRPALADUTVCDYRAQIA 
♦AASTPKRAASIAHNAVSCR*AOIA 


360 


1710 


A 


3134 


1 


286 


■ REPPRPALLFF*DRVSLCCPGWNAWQSgLl 
AAPTSQVQ/SDSFITPSSWDYRHVPEYPANFL 
♦RQGFPMLPRLVSNSWAQTVHPPRPPKVLDL 

OA 


361 


1711 


A 


3135 


56 


1449 


" " PVPAPRVSPSARGAPGRPRLPGVRGPRHSWA 
AD^RGSRMA'PRAPAPSPTGP/APGGKK.VRGR 
VPEDPDAYEPRCSAL*V*PTHVTSPQFCDP*N 
GQIRSYFTVLLRGLNETMLVK/PLCRREP/PEA 
GPORQSTPAVTRDHRQHEDPROAGRQWDAD 
PRPSAP/PAEVATGSRPGRHMWMRLCLAAQQ 
APGLPHRTSIRPGWRRLTEPEAWARRHRRPW 
GQRGAVRPPPQGAAPPPSHQGRRTNTOPSAT 
PRLTVMSRCLAPDLKAPASGPRGWRRGMPQ 
SS/GALLWTPPPTPRGSHSPRPREAPLRAIHPA 
GPSKi^RAGASGRLPEVIYGWVTLFTPPEAGT 
F/LIPSPT*MSPALVIQPPVPPTQMGLRISGLPR 
QG*PSGAPW*LPGLAQLAFOCHLPHDEVGPP 
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SEQID 5 
NO: of 1 
nucl- 1 
eotide i 
seq- ' 
uence 


SEQID 1 I 
sfO:of \ 
peptide 
seq- 
aence 


^6t 1 i 
lod 1 

i 
1 


SEQ 1 
DNO: 1 
m 

LJSSN 
[)9/496 
914 


Predicted J 
beginnmg i 
aucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alanine C-Oysteine, 

rtsAcnartir Arid F=r«liitamic Acid- 

F=Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, l>Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Argininc, S=Serine, 
T«Threonine, V=Valinc, W=Tryptophan, 
Y^Tyrosine, X=Unknown, ♦^Stop codon, 
/=possible nucleotide deletion, V=possiblc 
nucleotide insertion 














RNQSPLGNDTLSSGLPMGPRRQVWPLARVU 
GHSSPREPQVLKKPLWGQTDIAGVGSASLYP 

DNL 




;i712 


A 


3136 


1270 


274 


RVGMVLGTREVGDSTPPPSPPLYPFTGNKi' vg 

HNTWQLpSRVYrSDl-K 1 UobM i iNr v^i*- " i^vj 
CQlvW*GGSRDWEEGVEEQQVGNKFSSDGR 
VGECSRKLLG*EMLSVDrrSRYRAPSTYLLNS 
LKEGLEGLHGESCSSFLLGPSVAMNMQTAGL 

EMDICDGHFRQNuGCO I VLNJ'ur i^iuji V'^'^ 

HPEKPISPFKAQTLLNQVISVQQLPKVDKTKE 

GSIVDPLVKVQIFGVRLDTARQETNYVENNG 

FNPYWGQTLCFRVLGPDFPMLRFGKMDYDW 

KSR2^LLGKTPCPGTCMQQuYKHmLi^JNJ-»vj 

ISLRPASIFVYICIQEGLEGDES 




1.713 


c 


3139 


60 


248 


" MFAGSYGKSMFSFSKKVLNCLPKWRYHFVIA 
PAMNESPLAPHLHQHLVt' 5> Vt v L 1 u-iu v 


364 


1714 


A 


3140 


57 


418 


'SAFKTLQLPAFSLYFDLGSLKLLILRlH'i^iVJf^ 
NHKVESPRTMSPG*DPQSFLQIPQPRPPQLRV 
GLTSGUQHFHSPSSCQFPLLRGPPFPRQPPLGI 
SGASLCPVLSPPR*PLQPSSL 


365 


1715 


A 


3145 


122 


413 


LLPYPSLFVFLRQCHFV1\RLHCNGWSAHUN 
LHLPGSSDSPASAS*VAGTTGVCHHmiF\VF 
LV*TGFHYVAQAGLELLTA*S\PPQLPKWGL 

OA 


366 


1716 


A 


3150 


247 


2 


VGEKLHDIRFGNDFDMTPKAQATKEKIDKLN 
FIKIKKLCIEGYY/NREPQNGRK1FANYVS\DK 
GLMATIYEELLKLSNKLIQ 


367 


1717 


A 


3152 


3 


2367 


QKLKQNQPKRAHVEDGGSRSKQGNEgt>KK.l 

PIEKSDFAAATHPRAFYLSKPDETTNAWMSD 

SGTGLTYWKLEEKDMHHSLPETLEKTFISLSS 

TDVSPNQVLTLDPTLHMKPKQQISGIQPHGLP 

NALDDRISFSPDSVLEPSMSSPSDIDSFSQASN 

VTSQLPGFPKYPSHTKASPVDSWKNQTFQNE 

SRTSSTFPSVYTrrSNDISVNTVDEENTVMVAS 

ASVSQSQLPGTANSVPECISLTSLEDPVILSKIR 

QNLKEKHARHIADLRAYYESEINSLKQKLEA 

KEISGVEDWKITNQILVDRCGQLDSALHEATS 

RVRTLENKNNLLEIEVNDLRERFSAASSASKI 

LQERIEEMRTSSKEKDNTIIRLKSRLQDLEEAF 

ENAYKLSDDKEAQLKQENKMFQDLLGEYES 

LGKEHRRVKDALN 1 icJiM>LrUAi iKiioLfus\j\ 

MISKLEAQVKQVEHENMLSLRHNSRIHVRPS 

RANTLATSDVSRRKWLIPGAEYSIFTGQPLDT 

QDSNVDNQLEETCSLGHRSPLEKDSSP/GSSST 

SLLIKKQRETSDTPIMRALKELDEGKIFKNWG 

TOTEKEDTSNSa*/INPRQTETSVNASRSPEK 

CAQQRQKRLNSASQRSSSLPPSNRKSSTPTKR 

EIMLTPVTVAYSPKRSPKENLSPGFSHLLSKN 

ESSPIREKTYSEKATDNHVNHSSCPEPVPNGV 

KKVSVRTAWEKNKSVSYEQCKPVSVTPQGN 

DFEYTAKIRTLAETERFFDELTKEKDQIEAAL 

SRMPSPGGRITLQTRLNQVKCLSLNLL 


368 


1718 


A 


3163 


2 


2350 


' "EFKSGGCGAGLVAAGAVtVLYPASRAUliKl 
RVPGSPAPSSLPLHSPGACGTEVDMDPQRSPL 
LEVKGNIELKRPLIKAPSQIi^SGSRLKRRPDQ 
MEDGLEPEKKRTRGLGATTKITTSHPRVPSLT 
TVPOTQGQTTAQKVSKKTGPRCSTAIATGLK 
NQKPWAVPVQK5GTSG\^PMAGGKia>SKRP 
AWDUCGQLCDLNAELIUUMRTQTLDQENQ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCystcine, 
I>=Aspartic Acid, E=OIutaniic Add, 
F^Phenylalanine, G=Glycine, H=Histidinc^ 
I=Isoleucine, K<,ysine, L=Leucine, 
M=Methionine,N=Asparagine, P^Proline, 
Q=OIutamine, R^Arginine, S=Serine, 
T»Threoninc, V«Valine, W«Tryptophan, 
Y«Tyrosine, X=Unknown, ♦^Stop codon. 
^=possible nucleotide deletion, V=possible 
nucleotide insertion 














QLQDQLRDAQQQVKALGTERTTLEGHLAKV 

QAQAEQGQQELKNLRACVLELEERLSTXJEGL 

VQELQKKQVELQEERRGLMSQLEEKERRLQT 

SEAALSSSQAEVASLRQETVAQAALLTEREER 

LHGLEMERRRLHNQLQELKGNIRVFCRVRPV 

LPGEPTPPPGLLLFPSGPGGPSDPPTRLSLSRSD 

ERRGTLSGAPAPPTRHDFSFDRVFPPGSOQIJE 

VFEEIAMLVQSALDGYPVCIFAYGQTGSGKTF 

TMEGGPGODPQLEGLIPRALRHLFSVAQELSG 

QGWTYSFVASYVEIYNETVRDLLATGTRKGQ 

GGECEIRRAGPGSEELTVTNARYvrvbUliMiV 

DALLHLARQNRAVARTAQNERSSRSHSVFQL 

QISGEHSSRGLQCGAPLSLVDLAGSERLDPGL 

ALGPGERERLRETQAINSSLSTLOLVIMALSN 

KESHVPYRNSKLTYLLQNSLGGSAKMUVlr V 

NISPLEENVSESLNSLRFASKVH^VLFGTAQS 

NRKWKTDPDLCVCVCVCVCVCVCVCVCVP 

MSMYRVRGGRVAGGCFIGWKAi'ut'KAiK 


369 


1719 


A 


3165 


365 


12 


GYTSQGRWmiERGPLTANTESLHENNFNALP 
GyiRKIE*l*IYKKN*INFGOVGLLNIVKISILS/K 
lYRFDAIPVKILTRFFINLDKLILKFVLKTKIAK 
NRIKTFYIMRRKKLGUSS 


370 


1720 


A. 


3170 


393 


42 


GASISPSAVIDGVEGLKPMQEQEAQEAGFULU 
♦HMAPEQWVAPR\RLLFRLIFSVLHALnAAAA 
QSSAEEDEDPRN*GQSSEDQAPNQNGLIVIVH 
RVHVPLGAAATVPVHRSHFPR 


371 


1721 


A 


3173 


770 


310 


GNGGCGLSQIPPSHLGAFSRGSLLSRGNDFRGF 

PPHPVIFFVFSrVE\QGFTVLAR14VS]S»PCDPP 

ALASQSAGITGVSHLARPQNLYF 


372 


1722 


A 


3180 


381 


76 


RVLHHDNVPAHSSPQKREISQEFQLEIRHLP'S 
PDLAPSOCFLFLNLKNIFK\GTHFSLVDNVKK 
TVSTWLH/SQNAQFYKDRLNGWYHCLQKCL 
OHY*AYVEK 


373 


1723 


A 


3181 


410 


14101 


RKEVAGPEGKGLLLASAHTMLTPPLLLLLf'LL 

SALVAAAIDAPKTCSPKQFACRDQITCISKGW 

RCDGERDCPDGSDEAPEICPQSKAQRCQPNE 

HNCLGTELCVPMSRLCNGVQDCMDGSDBGP 

HCRELQGNCSRLGCQHHCVPTLDGPTCYCNS 

SFQLQADGKTCKDFDECSVYGTCSQLCTNTD 

GSFICGCVEGYLLQPDNRSCKAKNEPVDRPP 

VLLIANSQNILATYLSGAQVSTITPTSTRQTTA 

MDFSYANETVCWVHVGDSAAQTQLKCARM 

PGLKGFVDEHBNISLSLHHVEQMAIDWLTGN 

FYFVDDIDDRIFVCNRNGDTCVTLLDLELYNP 

KGIALDPAMGKVFFTDYGQIPKVERCDMDG 

QNRTKLVDSKIVFPHGnXDLVSRLVYWADA 

YLDYIEWDYEGKGRQTnQGILIEHLYGLTVF 

ENYLYATNSDNANAQQKTSVIRVNRFNSTEY 

QWTRVDKGGALHIYHQRRQPRVRSHACEN 

DQYGKPGGCSDICLLANSHKARTCRCRSGFS 

LGSDGKSCKKPEHELFLVYGICGRPGIIRGMD 

MGAKVPDEHMIPIENLMNPRALDFHAETGn 

YFADTTSYLIGRQKIDGTERETILKDQIHNVE 

GVAVDWMGDNLYWTDDGPKKTISVARLEK 

AACyrRKTLIEGKl^OTIPRAIVVDPLNGWMYW 

TDWEEDPKDSRRGRLERAWMDGSHRDIFVT 

SKTVLWPNGLSLDIPAGRLYWVDAFYDRIETI 

LLNGTDRKIVYEGPELNHAFGLCHHGNYLFW 

TEYRSGSVYRLERGVGGAPPTVTLLRSE^FPI 
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Amino acid sequence (A=Alaninc 0=Cystcmc. 
I>=Aspaitic Acid, EFOlutamic Add, 
F^^^ylalanine^ G^Glydne, H^Histidiae. 
I=Isoleucine, K^Lysine^ LpLcucine, 
M=Methionine, N=Asparaginc, P=Proline» 
Q=Glutamine, R=»Arginine, S=Sciine, 
T^ThreonincV-Valine.W'^iyptophan, \ 
Y^Tyrosine, X=Unknown, ♦^^top codon, 
/possible nucleotide deletioQ, V^ssible 
nucleotide insertion 














feirnmydaqhqqvgSnkcrVnnagcssjlcl 

atpgsrqcacaedqvldadgvtclanpsyvp 

ppqcqpgefacansrciqerwkcdodndcld 

nsdbapau:hqhtcpsdrpkcennrcipnrw 

lcdgdndcgnsedesnatcsartcppnqfsc 

asgrcipiswtcdldddcgdrsdesascaypt 

otltqftcnngrcininwrcdndndcgdns 

deagcshscsstqfkcnsgrcepehwtcdod 

ndcgdysdethanctnqatrppgocmdef 

qcrldglciplrwrcdgdtdcmdssdeksce 

gvthvcdpsvkfgckdsarciskawvcdgd 

ndcednsdeenceslacrppshpcalwrsvc 

lppdklcdgnddcgdgsdegelcdqcslnn 

ggcshncsvapgegivcscplomelgpdnht 

cqiqsycakhlkcsqkcdqnkfsvkcscyeg 

WVLEPDGESCRSLDPFKPniFSNRHElRRlDLH 
KGDYSVLVPGLKNTIALDFHLSQSALYWTDV 
VEDKIYRGKLLDNGALTSFEWIQYGLATPEG 
LAVDWIAGNIYWVESNLDQIEVAKLDGTLRT 
TLLAGDIEHPRAIALDPRDGILFWTDWDASLP 
RIEAASMSOAGRRTVHRETGSGGWPNGLTV 
DYLEKRILWIDARSDAIYSARYDGSGHMEVL 
RGHEFLSHPFAVTLYGGEVYWTDWRTOTLA 
KANKWTGHNVTVVQRTNTQPFDLQVYHPSR 
QPMAPNPCEANGGQGPCSHLCUNYNRTVSC 
ACPHUviKLHKDNTTCYEFKKFLLYARQMEIR 
GVDLDAPYYNYnSFTVPDIDNVTVLDYDARE 
QRVYWSDVRTQADCRAFINGTGVETVVSADL 
PNAHGIAVDWVSRNLFWTSYDTNKKQINVA 
RLDGSFKNAWQGLEQPHGLWHPLRGKLY 
VraX>DNISMA>IMDGSNRTLLFSGQKGPVGL 
AIDFPESKLYWISSGNHTINRCNLDGSGLEVID 
AMRSQLGKATALAIMGDKLWWADQVSEKM 
OTCSKADGSOSWLRNSTTLVMHMKVYDESl 
QLDHKGTNPCSVNNGDCSQLCLPTSETTRSC 
MCTAGYSLRSGQQACEGVGSFLLYSVHEGIR 
GPLDPNDKSDALVPVSGTSLAVGIDFHAEND 
TTYWVDMGLSTISRAKRDQTWREDVVTNGIG 
RVEGIAVDWIAGNIYWTDQOFDVIEVARLNG 
SFRYWISQGLDKPRAITVHPEKGYLFWTEW 
GQYPRIERSRLDGTERWLVNVSISWPNGISV 
DYQDGKLYWCDARTDKIERIDLETGENREW 
LSSNNMDMFSVSVFEDFIYWSDRTHANGSIK 
. RGSKDNATDSVPLRTGIGVQLKDIKVFNRDR 
QKGTOVCAVANGGCQQLCLYRGRGQRACA 
CAHGMLAEDGASCREYAGYLLYSERTILKSI 
HLSDERNLNAPVQPFEDPEHMKNVIALAFDY 
RAGTSPGTPNRIFFSDIHFGNIQQINDDGSRRIT 
IVENVGSVEGLAYHRGWDTXYWTSYTTSTIT 
RHTVDQTRPGAFEEUETVITMSGDDHPRAFVL 
DECQNUdFWTNWNEQHPSIMRAALSGANVL 
TLIEKDmTPNGLAIDHRAEKLYFSDATLDKIE 
RCrWXJSHRYVILKSEPVHPFGLAVYGEHIF 
WTDWVRRAVQRANKHVGSNMKLLRVDIPQ 
QPMGHAVANDTNSCELSPCRINNGGCQDLCL 
LTHQGHVNCSCRGGRE,QDDLTCRAVNSSCR 
AQDEFECANOEONFSLTCDGVPHCKDKSDE 
KPSYCNSRRCKKTFRQCSNGRCVSNMLWCN 
GADDCGDGSDEIPCNKTACGVGEFRCRDGTC 
IGNSSRCNQFVDCEDASDEMNCSATDCSSYF 
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Viet S 
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i 
\ 
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correspondi 
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residue of 
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Predicted end 
nucleotide 
location 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aiamnc OCysteme, 
[>Aspartic Acid, E=01utamic Acid, 
F=Phcnylaianine, G=Olycine, H=Histidme, 
Msoleucine, K=Lysine, L=4^ucine, 
M«Methiomne,N=Asparagine, P=Proiine, 
Q=<31utamine, R=Arginine, S=Serine, 
T^Thrconine, V«=Valine, W^Tryptophan, 
Y-Tyrosine, X«Unknown, *°Stop codon, 
/=possible nucleotide deletion. \=possiblc 
nucleotide insertion 














■RLGVKGVLFQPCERTSLCYAPSWVCDCjAWU 

CGDYSDERDCPGVKRPRCPLNYFACPSGRCIP 

MSWTCDKEDDCEHGEDETHCNKFCSEAQFE 

CQNHRCISKQWLCDGSDDCGDGSDEAAHCE 

GKTCGPSSFSCPGTHVCVPERWLCDGDKDCA 

DGADESIAAGCLYNSTCX)DREFMCQNRQCIP 

KHFVCDHDRDCADGSDESPECEYPTCGPSEF 

RCANGRCLSSRQWECDGENDCHDQSDEAPK 

NPHCTSPEHKCNASSQFLCSSGRCVAEALLCN 

GQDDCGDSSDERGCHINECLSRXLSGCSQDC 

EDLKIGFKCRCRPGFRLKDDGRTCADVDECS 

TTFPCSQRCINTHGSYKCLCVEGYAPRGGDP 

HSCKAVTDEEPFLIFANRYYLRKLNLDGSNY 

TLLKQGLNNAVALDFDYREQMTYWTDVTTQ 

GSMIRRMHLNGSNVQVLHRTGLSNPDGLAV 

DWVOGNLYWCDKGRDTIEVSKLNGAYRTVL 

VSSGLREPRALWDVQNGYLYWTDWGDHSL 

IGRlGMDGSSRSVrWTKrrWPNGL-aDYVTE 

RIYWADAREDYIEFASLDGSNRHWLSQDIPH 

IFALTLFEDYVYWTDWETKSINRAHKTTGTN 

KTLLISTLHRPMDLHVFHALRQPDVPNHPCK 

VNNGGCSNLCLLSPGGCaiKCACPTNFYLGSD 

GRTCVSNCTASQFVCKNDKCIPFWWKCDTE 

DDCGDHSDEPPDCPEFKCRPGQFQCSTGICTN 

PAFICDGDNDCQDNSDEANCDIHVCLPSQFK 

CTNTNRCffGIFRCNGQDNCGDGEDERDCPE 

VTCAPNQFQCSITKRCIPRVWVCDRDNDCVD 

GSDEPANCTQMTCGVDEFRCKDSGRCIPARW 

KCDGEDDCGDGSDEPKEECDERTCEPYQFRC 

KNNRCVPGRWQCDYDNDCGDNSDEESCTPR 

PCSESEFSCANGRCIAGRWKCDGDHDCAJDGS 

DEKDCTPRCDMDQFQCKSGHCIPLRWRCDA 

DADCMDGSDEEACGTGVRTCPLDEFQCM^ 

LCKPLAWKCDGEDDCGDNSDENPEECARFV 

CPPNRPFRCKNDRVCLWIGRQCDGTDNCGD 

GTDEEDCEPPTAHTTHCKDKKEFLCRNQRCL 

SSSLRCNMFDDCGDGSDEEDCSIDPBa-TSCAT 

NASICGDEARCVRTEKAAYCACRSGFHTVPG 

QPGCQDINECLRFGTCSQLCNNTKGGHLCSC 

AKKFMKTHNTCKAEGSEYQVLYIADDMEIRS 

LFPGHPHSAYEQAFQODESVRIDAMDVHVKA 

GRVYWTNWHTGTISYRSLPPAAPPTTSNRHR 

RQIDRGVTHLNISGLKMPRGIAIDWVAGNVY 

WTDSGRDVIEVAQMKGENRKTLISGMIDEPH 

AIWDPLRGTMYWSDWGNHPKBBTAAMDOT 

LRETLVQDNIQWPTGLAVDYHNERLYWADA 

KLSVIGSIRLNGTDPIVAADSKRGLSHPFSmV 

FEDYIYGVTYINNRVFKIHKFGHSPLVNLTGG 

LSHASDWLYHQHKQPEVTNPCDRKKCEWL 

CLLSPSGPVCTCPNGKRLDNGTCVPVPSPTPP 

PDAPRPGTCNLQCFNGGSCFLNARRQPKCRC 

QPRYTGDKCELPQCWEHCRNGGTCAASPSG 

MFrCRCFTGFTGPKCTQQVCAGYCANNSTCT 

VNQGNQPQCRCXPGFLGDRCQYRQCSGYCE 

WGTCQMAADGSRQCRCTAYFEGSRCEVNK 

CSRCLEGACWNKQSGDVTCNCTDGRVAPS 

arcVGHCSNGGSCTMNSKMMPECQCPPHM 

TGPRCEEHVFSQQQPGHIASILIPLIXLLIXVL 

VAOWFWYKRRVQGAKGFQHQRMTNGAM 

NVEIGNPTYKMYEGGEPDDVGGLLDADFAL 
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correspondi 
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amino acid 
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nucleotide 
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of peptide 
sequence 


Amino acid sequoice (A=Alanine C=Cysteine, 
D=Aspartic Acid, EpGlutamic Acid, 
F-^licnylalanine, GKjlycine, H-Histidinc, 
I=IsoleiJcinc, K=Lysine» L=Leucine, 
M=Methionine, N='Asparagine, P=Proline, 
Q=Glutainine, R^Argininc, S^Serine, 
T=Thieonine, V=Valine, W=Tiyptophan, 
Y^Tyrosine, X^Unknown, *=Stop codon, 
/=possible nucleotide deletion, \Fpossib)c 
nucleotide inseition 














DPDkFrNFTNPV Y AILVMUGHOSKHSLAS 1 U 
EKRELLGRGPEDEIGDPLA 


374 


1724 


A 


3187 


191 


1815 


CLELASAGKIPEESKALSLLAPAPTMTSLMrU 

AGLLPnTPNPLmGVSLSSLOAEPAAALDPNI 

ATLGEIPQPPLMGNVDPSKIDEIRRTVYVGNL 

NSQTTTADQLLEFFKQVGEVKFVRMAGDET 

QPTRFAFVEFADQNSVPRALAFNGVMFGDRP 

LKINHSNNAIVKPPEMTPQAAAKELEEVMKR 

VREAQSFISAAIEPGWLHSTSLCNDFLGCF*RR 

RMYRE*APCTICGTFHLCLIINWDL*LF*AYTA 

K*ITPPRVWKEQ*KKRR\RSRSHTRSKSRSSSK 

SHSRRKRSQSKHRSRSHNRSRSRQKDRRRSK 

SPHKKRSKSRERRKSRSRSHSRDJCRKDTREKl 

KEKERVKEKDREKEREREKEREKEKERGKN 

KDRDKEREKDREKDKEKDREREREKEHEKD 

RDKEKEKEQDKEKEREKDRSKEIDEKRKKDK 

KSRTPPRSYNASRRSRSSSRERRRRRSRSSSRS 

PRTSKTIKRKSSRSPSPRSRNKKDKKREKERD 

HISERRERERSTSMRKSSNDRDGKEKLEKNST 

S 


375 


1725 


A 


3192 


415 


101 


AHSSHQTRAILQEFQWDllRHPPLNSPNLALau 
F\FPNLKKSLRGTHFSSVKK\TrLTWLNSQDP 
WF/FFYP* SPDLQIPSSFRNOLNDWYHHSQKC 
PDLDGAYVKK 


376 


1726 


A 


3199 


931 


418 


GV*WCDLGSPQPPPPGFKQFCLGRSSSWDYK 
HVPPHPANFVFLLETGFLHAGQAGUGDPPAS 
ASQSAGITGVSHTWPKNHLIFYAaVIRSKRl 

K 


377 


1727 


A 


3201 


274 


1285 


KTGYTSRGSPLSPQSSIDSELSTSELEDDSISM 

GYKLQDLTDVQIMARLQEESLRQDYASTSAS 

VSRHSSSVSLSSGKKGTCSDQEYDQYSLEDEE 

EFDHLPPPQPRLPRCSPFQRGIPHSQfTFSSIREC 

RRSPSSQYFPSNNYQQQQYYSPQAQTPDQQP 

NRTNGDK/PPKKYA*PSPDAKYNCH**QHVSSP 

VTVRNSQSFDSSLHGAGNGISRIQSCIPSPGQL 

QHRVHSVGHFPVSIRQPLKATAYVSPTVQGSS 

NMPLSNGLQLYSNTGIPTPNKAAASGIMGRS 

ALPRPSLAINGSNLPRSKIAQPVRSFLQPPKPL 

S55LSTLRDGNWRDGCY 


378 


1728 


A 


3202 


112 


1789 


■ vPGVTESRPSVLRGDHLFALLSSETHQEDm 
YKGFVHKVXELDRVKLSFSMSLLSRFVGWG* 
PFKVNFY/TFNRQPLRVVQHRALELTGRWIXW 
PMLFPWAPRDVPLLPSDVKLKLYDRSLESNP 
EQLQAMRHIYTGTTRPAPYnFGPPGTQKTVT 
LVEAKQWKHLPKAHILACAPSNSGADLLC 
QRLRVHLPSSIYRLLAPSRDIRMVPEDIKPCCN 
WDAKKGEYWPAKKKLQEYRVLrraiTAGR 
LVSAQFPIDHFTHIFIDEAGHCMEPESLVAIAG 
1 x/rwrrnnprifiOT VLAGDPROLGPVLRSPL 
TQKHGLGYSLLERLLTYNSLYKKGPDOYDPQ 
FITKLLRNYRSHPTILDIPNQLYYEGELQACA 
DVVDRERFCRWAG\LPRQGFPIIFHGVMGKD 
EREGNSPSFFNPEEAATVTSYLKLLLAPSSKK 
GKARLSPRSVGVlSPYRKQVEKIRYCnKLDR 
ELRGLDDIKDLKVTCCSTVTPCLPCAPTCPLP 
ETSSSFHSSPRPRPTPAALNRARALPEPLTPGD 
SNLRVWDGIRKPACLTNTSCHS 


379 


1729 


A 


3206 


432 


130 


"PKAAPSVXLWPFFL'USFKPTKGHXXCVXUJL 
*LSTREAXDSXPOROIAXXROGGKVErrrAL 
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I=Isoleucine, K=Lysine, L?=Leucine> 
XA=Methionine. N=Aspanudne, I^Proline, 
Q=Glutamine, R='Argininei, S=Serine, 
T=Threonine, V«Valine» W-Tiyptophan, 
Y=Tyrosine, X»Unknown, ♦=Stop codon, 
/=possible Qucleotkle deletion, ^lossible 
nucleotide insertion 














■XKQSNNKGTRASSY5tePJiAXli(iVi^HK^ 
OLPGXTHE 


380 


1730 


A 


3207 


187 


507 


GGTGHPHPARPPLSGVGGCQCSHSKPWTAGS 

i>r?r\T>r»txD A DuvnTF A nnfiT PfiPOA WGG*ICGP 
PEQKDHr At rlTcV^lli/ViJViUi-J^ Vf^v vr ivvj * 

AXLLPGPGGGPGPVASLEARAQASSGVTPNG 
GGRTYPYPTFSSGE 


381 


1731 


A 


3225 


I 


840 


GTRPGHLPArSDOrCV/rlL'oirawuor \jnou 

EMQLITSLGLQEFDIARNVLELIYAQTLVWIGI 

FFCPLLPHQNOMLFIMFYSKNISLMMNFQPPS 

KAWRASQMMTFFIFLLFFPSFTGVLCTLAm 

WRLKPSADCGPFRGLPLFIHSIYSWIDTLSTRP 

GYLWVWIYRNLlGSVHFFFILTLIVLnTYLY 

WQITEGRKIMIRLLHEQimEGKDKMFLIEKLI 

KLQDMEKKANPSSLVl^RREVEQQGFLHLGE 

HDGSLDLRSRRSVQEGNPRA 


382 


1732 


A 


3238 


256 


38 


"LLMIKVSSTCFSCHLHHHHHHHHRHHQGHI^S 
LFFSLKSSSNSS1LPVYLSYNIILVFSKCLVFDF 

LFSNACL 


JO J 


1733 


A 


3241 


1542 


343 


KGAPSFVKLYQYPNFAGPHAALANKSFrKAU 

KVrMLWNKKATAVLVIASTDVDKTGASYYO 

EQTLHYIATNGESAVVQLPKNGPIYDVVWNS 

SSTEFCAVYGFMPAKATIFNLKCX>PVFDFGTG 

PRNAAYYSPHGHILVLAGFGNLILQI*AD/IMK 

VWNVKNYKLISKPVASDSTYFAWCPDQEHIL 

TATCAPRLRVNNGYKIWHYTGSILHKYDVPS 

NAELWQVSWQPFLDGIFPAKTITYQAVPSEVP 

NEEPKVATAYRPPALRNKPITNSKLHEEEPPQ 

NMKPQSGNDKPLSKTALKNQKKJiliA^ 

QEARSDKSPDLAPTPAPQSTPRNTVSQSISGDP 

EIDK3aK>aiCKJaJCAIEQUXQAATO^ 

NOLEKIOKETAIXQELEDLELGI 


384 


1734 


A 


3242 


3 


678 


IRSPAARSPGLETPrCLLF VIAAIAA V 1* ^SAIP 

RLTQHRPQDGSFPYTILDPPLYLPGQCAPPQP 

LSQCARRVHGEKLRRFTFGPRHRGAGTAKMS 

ASLVRATVRAVSKRKLQPTRAALTLTPSAVN 

KKQLLKDKPEHVOVKVGVRTROCNGLSYTL 

EYTKTKGDSDEEVIQDOVRVFIEKKAQLTLL 

GTCMDYVEDia-SSEFVFNNPNIKGTCGCGES 

FN! 


385 


1735 


A 


3243 


3190 


664 


■ VAMGTPRAQHPPPPQLLFLILLSCPWIQULPL 
KEEEILPEPGSETPTVASEALAELLHGALLRR 
GPe^GYLPGPPLGPEGGEEETTTTnTTTTVTr 
TVTSPVLCNNNISEGEGYVESPDLGSPVSRTL 
GLLDCTYSIHVYPGYGIEIQVQTLNLSQEEELL 
VLAGGGSPGLAPRLLANSSMLGEGQVLRSFT 
NRLLLHFQSPRVPRGGGFRIHYQAYLLSCGFP 
PRPAHGDVSVTDLHPGGTATFHCDSGYQLQG 
EEmCLNGTRPSWNGETPSCMASCGGTlHNA 
TLGRIVSPEPGGAVGPNLTCRWVIEAAEGRRL 
HLHFERVSLDEDNDRLMVRSGGSPLSPVIYDS 
DMDDVPERGLISDAQSLYVELLSETPANPLLL 
SLRFEAFEEDRCFAPFLAHGNVTTTDPEYRPG 
ALATFSCLPOYALEPPGPPNAIECVDPTEPHW 
NDTEPACaCAMCGGELSEPAGWLSPDWPQS 
YSPGQDCVWGVHVQEEKRIIXQVEILNVREG 
DMLTLFDGDGPSARVLAQLRGPQPRRRLLSS 
GPDLTLQFQAPPGPPNPGLGQGFVLHFKEVPR 
NOTCPELPPPEWGWRTASHGDLIRGTVLTYQ 
CEPGYELLGSDILTCOWDLSWSAAPPACQKI 
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D=Aspartic Acid, E=01utamic Acid, 
F«Piicnylalanine, G=Olycine, H=Histidinc, 
I=Isoleucine, K=Lysine, L?=Leucine, 
M-Methionine^ N=Asparagine, P^Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
T»Thrconinc, V«Valinc, W^'Tiyptophan, 
Y«Tyrosinc X^Unknown, ♦HStop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














MTCADPGEIANGHRTASDAGFPVGSHVQYRC 

LPGYSLEGAAMLTCYSRDTGTPKWSDRVPKC 

ALKYEPCLNPQVreNGYQTLYKHHYQAOESL 

RFFCYEGFELIGEVTTTCVPGHPSQWTSQPPLC 

KVTQrrroPSRQLEGG>aJU-AIl^ 

SGVYrmXLQGKSLFGFSGSHSYSPrrVESDF 

SNPLYEAGDTRBYBVSI 


386 


1736 


A 


3250 


5725 


3984 


GTSTVTMATKIQffSIIU^UOMIXKKD 

RKLLMTWAl^AVVMKKSETYAPLFCLPSF 

WOCGUj^TLVEDVNICLQACSSLHALSSS 

LPDDLLQRCVDVCRVQLVHRGTCIRQAFGKL 

LKSIPLGVFLSNIWHTEIQEISLALRSHMSKAP 

SNTFHPQDFSDAaSFILYGNSHRTGBCDKWLE 

RLFYSCQRLDKRDQSnPKNLUCTDAVLWQW 

A rare A A rktJTVf C^I TJTTDT fSQ ArynTPnTTF/ilTR 

SLAGHTLNPDQDVSQWTTADNDEGHGNNQL 
RLVLIXQYLENLEKLMYNAYEGCANALTSPP 
KVIRTFLYTNRQTCQDWLTRIRLSIMRVGLLA 

MMWEALCELHCPEAIQOIAVWSSSIVGKHL 

LWINSVAQQAEGRFEKASVBYQEHLCAMTG 

vnm^QFnv Wl TLASlAnCKSASLKHCLNGE 

SRKSVLSKPTDSSPEVINYLGNKACECYISTA 

DWAAVQEWQNAIHDLKKSTSSTSLNLKADF 

NYIK^LSSFESGKFVECrEQLELLPGENINLLA 

onSKEKIDMKKLLRNM 


387 


1737 


A 


3255 


380 


76 


mdiflynckyqvqtei^nsiqhimavskklsrf 
licyvhnl*aenyktlmk*inedlnk:qrdvpy 
s*tarlnkmsiptktifrfkaiy1kipatyfiet 

NMQ 


388 


1738 


A 


3260 


685 


428 


pqwlglqvyalppanfvffvemrstilaqto 
felldssdlpasasksagitcmshhartlslk 
*wpfclsatqekfc*pasegvaw 


389 


1739 


A 


3269 


1 


332 


LDGYHTPIYMLNRIIRLPAAL*IISDQTGHALTI 

ltrletqminadyqnkltldyllttdrevye 
pfnltnycunhnqrlgaydlg*v*q/klahv 
pvqv*hgfdpeamfr 


390 


1740 


A 


3270 


2 


372 


GRCHDQNKGKSMXjPDAQAEACGGESTYQEL 
LVNQNPIGQPLACRRLTRKIYEGIKKAVKPNH 
SPROVKKVHKFVNKGEKGMVLAGDTLGIGV 
YCLLPCMC*DRKLTyAHIPSTTDLGAGAGY 


391 


1741 


A 


3273 


1 


187 


FFQEMU5IMKAISDMMGKCTYPVLKEDAPRQ 

HVETFFQVmTRSQBGMKLGENFLMFAMPP 

DDSKESKGK*FFQEMLDIMKAISDMMGKCTY 

PVLKEDAPRQHVETFFQVGINQKSRGHEVRR 

KFPDVCHAPR 


392 


1742 


A 


3281 


901 


521 


FFFGDGVSPCRQAOV'WHDLDSLQNLPPGFK 
RFSYLSLPSSW\DYRHVLPRQANFCIF/M»RRG 
FTMLARMVSIS'PRDLPALASQSAGITGVSHH 
APPQMDFTFALLCFALKGCLPRQKEGGTLNLI 


393 


1743 


A 


3283 


385 


3 


RNRSVVPEFVIXGI^AGPQTQTIJJ'VIJVVIC 
LLTVMGNIXLLVVWADSCLHTPMYFFLGQL 
SFLDLCHSSVTAPKLLENLLSEKKTISVEGCM 
A*VFFVFATGGTESSLLAVMAYDRYVAIRTR 
G 


394 


1744 


A 


3284 


575 


1054 


CmCKADCI)TCTNKNFCTKCKSGFYLHLGKC 
LDNCPEOIJEANNinMECVSIVHCEVSEWNP 
WSPCTKKGKTCGFKRGTETRVREnQHPSAKG 
NIX^PPTOETRKCTVQRKKCQKGERGKKGRE 
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+n loot omtno 

acid residue 
sequence 


Amino acid sequence (A^Alanine OOysteine; 
D=Aspartic Acid, EKjlutamic Acid, 
F=Phcnylalanine, G^Olycine, IMistidine, 
Msoleudne, K=Lysine, L=Leucine, 
M^Methionine, N^Asparagine, P^Proline, 
QKjlutamine, R«Arginine, S^Scrine, 
T=Thrconinc, V«Valinc, W^Tiyptophan, 
Y«Tyrosine, X=Unknown, ♦^Stop codon, 
/^possible nucleotide deletion, Nppossible 
nucleotide insertion 














RKRKKFNkGESKEAIPDSKSLESSKEIPfiQRliN 
KOOO 


395 


1745 


A 


3286 


1 


340 


RVLYWSMGFCn-VAHGWQKISTKSVFKKLS 
WICLSMmTHSLKTFHRNWDWESEYTLFMS 
ALKVNKNNAKLWNNVGHALENEKNFERAL 
KYFLQATHVQPDDIGAHMNVGR 


396 


1746 


A 


3293 




172 


GFRAVVMTVKTEAAKGTLTYSRMROMVAIL 
lAFMKORRMGLNDFlQKIANNSYACKQ 


397 


1747 


A 


3295 


12 


401 


AEPACGASSCTPPSLRSSSSQSVGPLRPUKFL 

WSEACAFL*AAAPQGPASPCCGLPSGFPRVW 

AQCCPPGGALRFPEGLGSVLSPRRC3PQVSRGS 

GLSAVPQEWSCjrLut^LKAUrv^tirtxORX 

GLT 


398 


1748 


A 


3300 


1912 


2768 


KQRRWQNIQRKGPKRYIVIAGNSQSHQFMlhS 

MLRKLPKVTCRDVLPEIRAICIEEIGCWMQSY 

STSFLTDSYUCYIGWTUIDKHREVRVKCVKA 

LKGLYGNRDLTARLELFTGRFKDWMVSMIV 

DREYSVAVEAVRLLILtt.KNMEGVLMDVDCE 

SVYPIV*ASN*GLASAVGEFLYWKLFYPECEI 

RTMGGREQRQSPGAQRTFFQLLLSFFVESKSH 

SVTQAGVQWQFSAHRDLaPGSSNSHVSASR 

VAGIAGAHRHTWLIYVFFSWRQGFAVLAGL 

VSNS 


399 


1749 


A 


3301 


536 


2391 


LRSYGCKAPSRISHLHIOTLFLLLPSLLMGYSE 

SPPPITDSWAPFISLTHHVLSQSQSPLSSNCW3 

CLSTHTQ*FrALPADLLTWTQSNVSLfflSYLAI 

PFLADSFLKPV/L*PGNSAKHLSFKLSSLSMVS 

GRAVALLHLIASGLTSIQTNTASSKPPIWGYU- 

STQTSFISPPPLCLSRTYPNPAHATMVGQVPQ 

SLCGLIFTL/RTPCRPSILHPNYKnSTSAWQKV 

LCFSGSPnHTSLHLTTOSSFLSFHPIPGFPAAN 

SALYVSSLKGPPCjKN V i JLror V l u i \imu^K>*3 

N/RLTVDKDNFFLSPKPNSLHQLPSQ\TPYQAL 

TGAALAGSYPIWENENTLSWIinTTYNF<XST 

PSLFFLCDTN*YLCLPANWSGTCTLVFQAPn 

NILPPNQTILISVEASISSSPIRNKWALHLITLLT 

ni riTA AT nmr A niTTSTTSYOTLKlTLSNTVE 

DMHTSrrSLQRQLDFLVGVDLQNWRVLDLLT 

TEKGGTCIYLQEECCFCVNESGIVHIAVRRLH 

DRAAEL*HQVADSWWQGSSLLRWIPWVAPF 

LGPLIFLFLLLMIGPCIFNLVSRFISQRLNCFIQ 

ASMQKHIDNIFHLCHV*YQSLRGNHSEAPEPR 

P 


400 


1750 


A 


3303 


2 


453 


Tu H«3<2f;VPGX'l'l*ARRRRRELEIATSDNQE 

YYmCQEVTtmERNDQKMIJVDLDDU4RTK 

KYLEERLEELLRDKDALWQKSDALEFQQKLS 

AEERWLGDTEANHCXDCKREFSWMVRRHHC 

RJCGRIFCYYCCNNYVLSKHGGKKERCC 


401 


1751 


A 


3304 


1 


626 


MAPQHSSLDDKVPQQASTVUWirVJL^uui^naV 

CTBHKDSLWGPGARSQPFGAHNTRLSPDSCy * 

EKIVLRALKDSRAGMPEQDKDPGVQENPDD 

ORRVPQGTGDAPSAFRPLWDNGGLSPFVSRP 

GPLERDLHAQRSEVTYNQRSQSSWMSSFPKR 

NAFVSPYSSMGQAQP/GLPKTNKGESCrWEG 

LSLSTQILG'OKPSKYIPSLCKR 


402 


1752 


A 


3305 


1678 


172 


■ melpsgpgperlfdshrlpulk;w.i.lvlllya 
pvgfcllvlrtflgihvflvscalpdsvlrrf 
wrtmcavlolvarqedsolrdhsvrvlisn 
hvtpfdhnivnllttcstvseseaesatgrfp 



175 



PCTAJSOl/03800 

WO 01/57188 



SEQ ID 
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nucl- 
eotide 
seq- 
uence 


SEQ ED 1 
MO: of 1 
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seq- 
uence 


Wet 
lod 
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[DNO: 

n 
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09/496 
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Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Aianine OCystcmc, 
CHAspartic Acid. B=Glutamic Acid, 
F^Phenylalaninc, G=Glycine, H«Histidine, 
l=Isoleucine, K=Lysine, LpLeucinc, 
M=Methionine, N^Aspaiagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T«Thrconine, V-Valinc. W«Tiyptophan. 
Y^Tyrosine, X=Unknown. *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














GAQLKAPLSPlJUT^DmiJPLTPILYJTCQ 

FFFFVIFLNIFLLAFSSPGSQPLLNSR*SFVCWSR 

GFMEMNGRGELVESLKRFCASTRLPPTPLLLF 

PEEEATNGREGLLRFSSWPFSIQDWQPLTLQ 

VQRTLVSVrVSDASWVSELLWSLFVPFTVY 

QVRWLRPVHRQLGEANEEFALRVQQNLVAKE 

LG\QrGTRLTPA\DKAEHMKRQRHPR\LRPQS 

AQSSFPPSPWVLSS/SDVQTGQTLGFREFKESF 

CPHVAIGWIPERPWPKTGCCKTLTIHLILL*G 

GPVSFSCPEU>IHPRGT*VPTQQASOLPSFPSYG 

PARrtGVL*HPSAOOPLTFAVKSS\WARAGRAL 

OERKOXALYEYARRRFTERRAPGGLD 


403 


1753 


A 


3307 


44 


447 


"DPSPSLLAVALGLRAGERTRSGPGySSPSUilS 
GGASAGLASSPECACGRSHFTCAVSALGECT 
CIPAQWQCDGDNDCGDHSDEDGCILPTCSPL 
FiPHmNOKriRRSWVCDSDNDCEDDSDEQD 
CPPRECEED 


404 


1754 


A 


3311 


409 


1 


PRHGWGRRVLGRDRPRLQKVKKSVKAlYir^J 
QDHVQNEEIYARVLDKFGSNFLSRDNADLGT 
AFVKFSTLTK*LSALLKNLLQGI^RNVIFTLDS 
T T If rjni xfiVKGDLKKPFDKAWKDYETKFAK 
lEKEKREREWR 


405 


1755 


A 


3322 


12 


458 


■ AAVPVENPWDDPRVRPRVRIFTWEDCIAUV^A 
KVLCNDSYGVTIDWSPKGAFIRLTSQSVGNG 
HPASKENDQMVDTIKKrnCVPIIWTYGDMVE 
PRPQMIRPAVGAKHKELWiaU4ALKKIK\IWE 
GKYTKPSOYNPNYMLELAHNDSVW 


. 406 


1756 


A 


3324 


1 


426 


LSMLSTISTEHRLSVLWPl*YCX:HCFrHLSAV 
MCVLLWALSLLQSILEWMFCSFLFSDVDSDN 
WCQILDFLTAVWLIFLIVLVLCGFrLVLLVRIIC 
GSQKMPLTRLYVTILLTGLVFLFCSLPLSIQ*F 
LLYWIEKDLDDL 


407 


1757 


A 


3328 


213 


1841 


"SGDLSPAELMMLT1GDVIKQLIEAHEQGKL>1L> 
LNKVKTKTAAKYGLSAQPRLVDIIAAVPPQY 
RKVLMPKLKAKPIRTASGIAVVAVMCKPHRC 
PHISFTGNICVYCPGOPDSDFEYSTQSYTOYEP 
TSMRAIRARYDPFLQTRHREBQLKQLGHSVD 
KVEFTV^GGTFMALPEEYRDYFIRNLHDALS 
GHTSNNIYEAVKYSERSLTKCIGITIETRPDYC 
MKRHLSDMLTYGCTRLEIGVQSVYEDVARD 
TNRGHTVKAVCESFHLAKDSGFKVVAHMMP 
DLPNVGLERDIEQFTEFFENPAFRPDGLKLYP 
TLVmGTGLYELWKSGRYKSYSPSDLVELVA 
RDLALVPPWTRVYRVQRDIPMPLVSSGVEHG 
NLRELALARIi^KDLGIQCRDVRTOEVGlQEIH 
HKVRPYQVELVRRDYVANGGWBTFLSYEDP 
DQDILIGLLRLRKCSEETFRFELGGGVSIVREL 
HVYGSWPVSSRDPTKFQHQGFGMLLMEEA 
ERIAREEHGSUKiAVioUVlJiiuNi irtlivj irs^ 
OGPYMVKMLK 


408 


1758 


A 


3335 


3 


467 


■ AIASPRAAGlRHELTSTMAAGKNKRLTK.WilJw 
KGAlOaCAV/DNIINIGKTLVTRTQRTKIASDG 
LKGRVFEESLADLQND\TDGYLLRVI*VAFTT 
ERTNQI/REVFNKLIPDSIGKDIEKACQSrYPLH 
DDFARKVKMLKKPKFELRKLMELHGEGSS 


409 


1759 


A 


3338 


7 


1252 


■ "PRWRNSARDEILLSFPQN YYIQWLNGSLmGl* 
WNLASLFSNLCLFVLMPFAFFFLESEGFAGLK 
KGIRARILETLGMLLLLALLILGIVWVASALID 
NDAASMESLYDLWEFYLPYLYSaSLMOCLL 
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nucleotide 
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correspondi 
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to last amino 
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sequence 


Amino acid sequence (A»Alamne (j^Lysteinc, 
I>=Aspartic Acid, EpGlutamic Acid, 
F=Phenylalanine, G=01ycinc, H=Histidinc^ 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutainiiie, R=Argimne, S=Serine, 
T^Threonine, V=Valine, W=Tryptophan, 
Y^Tyrosine, X«Unknown, ♦^Stop codon, 
r^ossiDie nucieouoe acieuon, vtaissii/iw 
nucleotide insertion 














LlXCTPVGUSRMFTVMGQlXVKJPTIl^DUjE 
QIYirrLEEEALQRFIXWAVFIRWyKYNIMELE 
QELENVKTl^TKLEKJRKKASAWEFNLVYPA 
VMVLLLIETSISVLLVACMLCLLVDETAMPK 
GTRGPGIGNASLSTFQFVGAALEnUFYLMVS 
SWGFYSLRFFGNFTPKKPDTTMTKnGNCVS 
ILVLSSALPVMSRTLGITRFDLLGDFGRFNWL 
GNFnVl^YNLlJFArmLCLVRKFTSAVREE 
LFKALGLHKLHLPNTSRDSETAKPSVNGHQK 

AL 


410 


1760 


A 


3339 


127 


1433 


'GSHRFSLASPLDPEVGPYCDTPTMRTLFNLL 
WLAIACSPVHTTLSKSDAKKAASKTLLEKSQ 
FSDKPVQDRGLVVTDLKAESWLEHRSYCSA 
KARDRHFAGDVLGYVTPWNSHGYDVTKVFG 
SKFTQISPVWLQLKRRGREMFEVTGLHDVDQ 
GWMRAVRKHAKOL\P*CLGSCLRTGLTMISG/ 
YVLDSEDEIEELSKTWQVAKNQHFDGFWE 
VWNQLLSQKRVGLIHMLTHLAEALHQARLL 
ALLVIPPAnPGTDQLGMFTHKEFEQLAPVLD 
GFSLMTYDYSTAHQPGPNAPLSWVRACVQV 
LDPKSKWRSKILLGLNFYGMDYATSKDAREP . 
WGARYIQTLKDHRPRMVWDSQVSEHFFEY 
KKSRSGRHVVFYPTLKSLQVRLELARELGVG 
VSIWELGQGLDYFYDLL»VGIAASAVDVFFSK 

PWSE 


411 


1761 


A 


3342 


74 


2701 


VATRKlAKGFTQFAmi'KUiKKTSIUU'K^^^^ 

FKGFGSFSNLPRSFTLRRSSASISRQSHLEPDTF 

EATQDDMVTVPKSPPAYARSSDMYSHMGTM 

PRPSIKKAQNSQAARQAQEAGPSCPNLVPGGV 

PDPPGLEAAKJEVNfVKATGPLEDTPAMEPNPS 

AVEVDPIRKPEVPTGDVEEERPPRDVHSERAA 

GEPEAGSDYVKFSKEKYILDSSPEKLHKELEE 

ELKLSSTDLRSHAWYHGRIPREVSETLVQRN 

GDFLIRDSLTSLGDYVLTCRWRNQALHFKJN 

KVWKAGESYTfflQYLFEQESFDHVPALVRY 

HVGSRKAVSEQSGAUYCPVNRTFPLRYLEAS 

YGLGQGSSKPASPVSPSGPKGSHMKRRSVTM 

TDGLTADKVTHSDGCPTSTSLPRPRDSIRSCA 

LSMDQIPDLHSPMSPISESPSSPAYSTVTRVHA 

APAAPSATALPASPVARRSSEPQLCPGSAPKT 

HGBSDKOPHTSPSHTLGKASPSPSLSSYSDPDS 

^TV«r>^r<kv y-tnTki/D/^CO'E71X/A ATT47XSQOnAR5!YGE 

GHYCQLQPPVRCjoKH W AA I is 1 mi^/mvo i vj& 

RLKELSENOAPEGDWGKTFTVPIVEVTSSFNP 

ATFQSLLIPRDNRPLEVGLLRKVKELLAEVDA 

RTLARHVIXVDCLVARn.GVTKEMQTLMGV 

RWGMELLTU»HG\RKLRLDLLERFHTMSIML 

AVDILGCTGSAEERAALLHKTIQLAAELRGT 

MGNMFSFAAVMGALDMAQISRLEQTWVTLR 

ORHTEGAILYEKKLKPFLKSLNEGKEGPPLSN 

TTFPHVLPLITLLECDSAPPEGPEPWGSTEHGV 

EVVLAHLEAARTVAHHGGLYHTNAEVKLQG 

FQARPELLEVFSTEFQMRLLWGSQGASSSQA 

RRVEICFDKVLTALSHKLEPAVRSSEL 


412 


1762 


A 


3347 


1 


898 


- IDRAAECRTKPLPMAVSIRGNADSIVACXVLM 
VLYLKIOU-VACAAVFYGFAVHMKIYPETYI 
LPITLHLLPDRDNDKSLRQFRYTFQACL^ELL 
KRLCNRTALMFVAVAGLTFFALSFGFYYEYG 
WEFIJEHTYFYHLTRRDIRHNFSPYFYMLYLT 
AESKWSFSLGIAAFLPQLILLSAVSFAYYRDL 
WCWFLHTSIFVTFNKVCrSOYFLWyLCIXPL 
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Amino acid sequence {A«=Alaninc C=Cysteinc, 
I>Aspartic Acid, EpGlutaraic Acid, 
F^Phenylalanlne; G=Glycinc; H«4fistidin^ 
I-Isoleucine, K-Lysine, IM^dne, 
M^Methionine, N^Asparagine, P=ProIine, 
QKjlutamine^ RFAighiine, S^erine» 
T«Thrconiae. V«VaIinc, W-O^iyptophan, 
Y«Tyrosine, X=Unknown. ♦^Stop codon, 
A=possible nucleotide deletioii, V=possible 
nucleotide insertion 














VMPLVRMPWKkA^hm4LWIGQAMW^ 

AYVLEFQGKNTFLFIWLAOIJT'LLINCSIUQII 

SHYKEEPLTERDCYD 


413 


1763 


A 


3361 


3 


474 


■piPVRWNSLEGRLLRGYEQHANDGKDYISRN 
♦DLRSWTAADMAAQITKRKWEAEEFAEQIKA 
YLEGTCVERA-RTHLENGKETLQLTEQSSQPTI 
PIVGIVAGLVLLGAWTGAWSAVMCRKKNS 
GHFLPTDRVSYSEAASSDHAQGSDVSLTACK 
V 


414 


1764 


A 


3363 


1488 


453 


unif VI K'K"K"TT TCTVNPDYDEDLVOEASSEDVL 

GVHJ^KDTERDDEMKRQLRRLRELHLYST 

WTf WOP AMKTf?! ,GVPORERDEGSLGKPLCP 

PEILSETLPGSVKKRVGFPSEDHLEEFIAEHLP 

EASNQSLLTVAHADAGTQTNGDLEDLEEHGP 

GQTVSEEATEVHMMEGDPDTLABLLIRDVLQ 

ELSSYNGEEEU)PEEVKTSLX3VPQRGDLEDLE 

EHVPGQTVSEEATGVHMMQVDPATLAKSDL 

EDLEEHVPEQTVSEEATGVHMMQVDPATLA 

vcw pnQTTTnwnOM^iASPSSAPAEEATEICnC 

VEEEVKTRKPKKKTRKPSKKSRWNVLKCWD 

IFNIF 


415 


1765 


A 


3369 


431 


315 


"IPWSWVGRLSVRKMSILF^LTYNYNAILNKTP 
PSFSPSL 


416 


1766 


A 


3373 


42 


651 


RQEKMGLGEIGASGVLRSMLKERKKQNMKG 
NGNVTLTPLLPAVQCGCHLQPAGRSPLPSSHS 
APGLCSPLHPLQFQQEASTCPSGTLQGREKAA 
PGQGRPLCSLWAGGAGAVFGERGAEGRGPSD 
QAPDPKSGPWLFPPGLGAPAEVRLHNVPmsa. 
RRPPLP*ARGK*PPNSGCPWSEGRAKQPLSCG 
PKPQCSLPSQVPGDTH 


417 


1767 


A 


3382 


2 


2061 


EAQDPRACGPDAGGRFAARDAPGNSLRPPPS 

SPP/GWPGQLRLLPRVPGSELRCGKPERGRLP 

ASPPGKIRGWPPGISKRPGLGGRSFPPGFAPRT 

WRPEARGPSVQSLPPIFSPQSAQTTAR»RPGAP 

KNAGRCGGA\RGPRLSLGPPPGPPPAPALPAR 

ASAGAGAAAAALAVGGVRGAGGARGTGGY 

GHCSGR/PTGRTGPGPQGPGPPMPARPR*AS\S 

TRGSRRGPGSRPARAAAAPRAGDHGRRPVRV 

HLRQHTAV»EPRLGDATAPPGGAAGPGAPAP 

R\GPGWDCALLPSPGPRSPRAVGCAEPEIWDP 

SPRRGTSPVPSVRSLRSEPANPRLGLPALLNSY 

PLKGPGLPPPWGPRTQTGHVXrrVQPSGSCIEH 

SKSLD/RGPWGAPPWGPSSSGLCSPKLATAGP 

PQSWGLCQIGRRRGLGGPGLKRGET/GLL+GC 

SMDHANRTKGPGVPTSNRCTSHIPGXGDGCSD 

HSSCEGHPDLHAGREMPAAPGLSELERVRFT 

VGCGGLASGISSASVSGLSPNRAGGPGQGDW 

EMYPVSWQTQESGGQGySPKTGR*VGMLQA 

GAGSLQOGTODGVWGLWEDGP/RG*DSPLPS 

GTGTEP*TPTTSIPFFPQPSGVYPSRATLLPMPS 

Y*ALGPSANKSEKPLLSFLYRGLCCRISLQLA 

KGIGQLSEIPLLNVETAFWSMWVTYFRK 


418 


1768 


A 


3398 


304 


2121 


F.F.P.T?FP.F.nKDDDDNNEEEEFECYPPGMKVOV 
RYGRGKNQKMYEASIKDSDVEGGEVLYLVH 
YCGWNVRYDEWKADKIVRPADKNVPKIKH 
RKKKNKLDKEKDKDEKYSPKNCKPPALGPN 
PPFQTNPISWKWYPKLDLTDAKNSDTAHIKSI 
ETTSILNGLQASESSAEDSEQEDERGAQDMDN 
NGKEESKIDHLTNNRNDUSKEEQNSSSLLEE 
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sequence 
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corresponding 
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add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
OAspartic Acid, E=01utamic Add. 
F=Phcnylalanine, G=Glycine. H=Histidine, 
I=Isoleucine, K=Lysine, L^Leucine, 
M-Methioiune, N=Asparagine, P^Ptoline, 
Q=<jlutamine, R=Aiginine, S=Serine, 
T«Thrconine, V«Valinc, W=Tiyptophan, 
Y^^rosine, X^Unknown, *=Stop codon, 
/=possible nucleotide deletion, \Fpossible 
nucleotide insertion 














l^VHADLVISKPVSKSPERLRKDIEVLSEDTl) 

YEEDEVTKKWa)VKKDTTDKSSKPQlKRGKR 

RYCNTEECLKTOSPGKKEEKAKNKESLCMEN 

SSNSSSDEDEEETKAKMTPTKKYNGLEEKRK 

SLR1TGFYSGFSEVAEKRIKLLNNSDERLQNS 

RAKDRKDVWSSIQGQWPKKTLKELFSDSDTE 

AAASPPHPAPEEGVAEESLQTVAEEESCSPSV 

ELEKPPPVNVDSKPIEEKTVEVNDRKAEFPSS 

GSNFSA*IPLPYLHLNRLHQSL*QKGSRQQSS 

VTVSEPLAPNQBEVRSIKSETDSTIEVDSVAGE 

LQDLQSERE»LASRF*CQCELKQ**SARTRTS* 

KSLYRSEKSERCSGRRKFIKKAEiCKP*SNSGK 

OOKEGK 


419 


1769 


A 


3399 


206 


463 


QRECLSIHIGQAGIQIGDACWELYCLEHGigi' 
NGWLDTQQDQLENAKMEHTNASFDTFFCE 
TRAGKHVFRAUFVDLEPTVIDGIR 


420 


1770 


A 


3408 


1010 


685 


■RRLSFFPIWSSVLVTQARVQWRDLGSPQFLi' 
PGFKRFSCLSLPSSWDYRHPSPRPVNF/HVFLV 
VMGFHHVGQAGLELLTSGDLPALASQSARTT 
QVNHCAOPRGHFH 


421 


1771 


A 


3409 


355 


1326 


AbSNLIESCWQELGLGPWOGDWRVEQVCiAb 

ASLRFPREVCSIRFLFTAVSLLSLFLSAFWLGL 

LYLVSPLENEPKBMLTLSEYHERVRSQGQQL 

QQLQAELDKLHBCEVSTVRAANSERVAKLVF 

QRLNEDFVRKPDYALSSVGASIDLQKTSHDy 

ADRNTAYFWNRFSFWNYARPPTVILEPHVFP 

GNCWAFEGDQGQWIQLPGRVQLSDnLQHP 

PPSVEHTGGANSAPRDFAVFFLLSFFTHQGLQ 

VYDETEVSLGKFTFDVEKSEIQTFHLQNDPPA 

AFPKVKIQILSNWGHIWTaYRVRAHGVRT 

SEGAEGSAOGPH 


422 


1772 


A 


3412 


Z 


421 


EFDAQPSIQALVVFKRP*A'nUSDPGPKRCjMN 

YLVSCSMRSPESGKGEPGTARDYTPMGRPPP 

PVPSVSPGPLPGSLAIAPHSPEPHPWEQQPPRG 

QARSPPGGWLGSAT/RVRRPHNHP/RGH/HSP 

VDTAGAPASPGPDVCE 


423 


1773 


A 


3420 


91 


706 


DAQRAIYSSVGPAVSLRQRQQDGAVKlJbUK/ 
RGGVRSFSRAAAAMAPIKVGDAIPAVEVFEG 
EPGNKVNLAELFKGKKGVLFGVPGAFTPGCS 
KIHLPGFVEQAEALKAKGVQWACLSVNDA 
FVTGEWGRAHKAEGKVRLLADPTGAFGKET 
DaLDDSLVSIFGNRRLKRFSMWQDOIVKA 
LNVEPDGTGLTCSLAFNHSQL 


424 


1774 


A 


3421 


4 


7688 


" RQVTRVGTRVLGSrrAAVFLSVEDDNUNAPQ 
FSEKRYWQVREDVTPGAPVLRVTASDRDKG 
SNAWHYSIMSGNARGQFYLDAQTGALDW 
SPLDYETTKEYTLRVRAQDGGRPPLSNVSGL 
VTVQVLDINDNAPIFVSTPFQATVLESVPLGY 
1 VI HVnAlDADAGDNARLEYRLAGVGHDFP 
FnNNGTGWISVAAELDREEVDFYSFGVEAR 
DHGTPALTASASVSVTALDVNDNNPTFTQPE 
YTVRLNEDAAVOTSWTVSAVDRDAHSVrrY 
QirSGNTRNRFSrrSQSGGGLVSLALPLDYKLE 
RQYVLAVTAaJGTRQDTAQIVVNVTDANTH 
RPVFQSSHYTVNVNEDRPAGTTVVLISATDE 
DTGENARrnTMEDSIPQFRIDADTGAVrrQA 
ELDYEDQVSYTLArrARDNGJPQKSDTTYLEI 
LVNDVNDNAPQFLRDSYQGSVYEDVPPFTSV 
LOISATDRDSGLNGRVFYTFOGGDDGDGDH _ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

ainino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence {A«Alanine (X^ysteine, 
D=Aspartic Acid, EKJlutamic Acid, 

henylalanine, G=Glycine. H=Histidinc; 
Msoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
•p=Threonine» V«Valine, W^Tiyptophan, 
Y-Tyrosinci, X«XJnknown, *«Stop oodon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














VESTSGIVRTLRRLDRENVAQYVLRAYAVDK 

GMPPARTPMEVTVTVLDVNDNPPVFEQDBFD 

VFVEK^SPIGLAVARVTATDPDEGTNAQIMY 

QIVEGNIPEVFQLDIFSOELTALVDLDYBDRPB 

YVLVIQATSAPLVSRATVHVRLLDRNDNPPV 

UjNFEnJTNNYVTNRSSSEPGGAIGRWAro 

DISDSLTYSFERGNELSLVLLNASTGELKLSR 

ALDNNRPLEAIMSVLVSDGVHSVTAQCALRV 

TirroEMLTHSITLRLEDMSPERFLSPLLGLFIQ 

AVAATLATPPDHVWFNVQRDTDAPGGHILN 

VSLSVGQPP(PGGGPPFLPSEDLQERLYLNRS 

LLTAISAQRVU^FDDMCLREPC^mdRCVSV 

LRFDSSAPFIASSSVLFRPIHPVGGLRCRCPPGF 

TGDYCETEVDLCYSRPCGPHGRCRSREGGYT 

CIXSRDOYTGEHCEVSARSGRCTPOVCKNGGT 

CVNLLVGGFKCDCPSGDFEKPYCQVTTRSFP 

AHSFTTFRGLRQRFHFTLALSFATKERDGLLL 

YNGRFNEKHDFVALEVIQEQVQLTFSAGEST 

TTVSPFVPGGVSDGQWHTVQLKYYNKPLLG 

QTGLPQGPSEQKVAWTVDGCDTGVALRFGS 

VLGNYSCAA\QGTQGGSKKSLDLTGPLLLGG 

VPDLPESFPVRMRQFVGCMRNLQVDSRHIDM 

ADHANNGTVPGCPAKKNYCDSKTCHNGGTC 

VNQWDAFSCECPLGFGGKSCAQEMANPQHF 

LGSSLVAWHGLSLPISQPWYLSLMFRTRQAD 

GVLLQATTRGRSTITLQLREGHVMLSVEGTGL 

QASSLRLEPGRANDGDWHHAQLALGAIGGP 

GHAILSFDYOQQRAEGNLGPRLHGLHLSNITV 

GGIPGPAGGVARGFRGCLQGVRVSDTPEGVN 

SLDPSHGESINVEQGCSLPDPCDSNPCPANSY 

CSNDWDSYSCSCDPGYYGDNCTNVCDLNPC 

EHQSVCTRKPSAPHGYTCECPPNYLGPYCET 

RIDQPCPRGWWGHPTCGPCNCDVSKGFDPDC 

NKTSGECHCKENHYRPPGSPTCLLCDCYPTG 

SLSRVCDPEDGQCPCKPGVIGRQCDRCDNPF 

AEVTTNGCEVNYDSCPRAIEAGIWWPRTRFG 

LPAAAPCPKGSFGTAVRHCDEHRGWLPPNLF 

NCTSITFSELKGFAERLQRNESGLDSGRSQQL 

ALLLRNATQHTAGYFGSDVKVAYQLATRLL 

AHESTQRGFGLSATQDVHFTENLLRVGSAIX 

DTANKRHWELIQQTEGGTAWLLQHYEAYAS 

ALAQNMRHTYLSPFTIVTPNIVISVVRLDKGN 

FAGAKLPRYEALRGEQPPDLETTVILPESVFR 

ETPPWRPAGPGEAQEPEELARRQRRHPELSQ 

GEAVASVIIYRTLAGLLPHNYDPDBCRSLRVPK 

RPIINTPWSISVHDDEELLPRALDKPVTVQFR 

LLETEERTKPICVFWNHSILVSGTGGWSARGC 

EWFRNESHVSCQCNHMTSFAVLMDVSRRE 

NGEILPLKTLTYVALGVTLAALLLTFFFLTLL 

RILRSNQHGIRRNLTAALGLAQLVFLLGINQA 

DLPFACrVIAILLHFLYLCTFSWALLEALHLY 

RALTEVRDVNTGPMRFYYMLGWGVPAFTTG 

LAVGLDPEGYGNPDFCWLSIYDTLIWSFAGP 

VAFAVSMSVFLYILAARASCAAQRQGFEKKG 

PVSGLQPSFAVLLLLSATWLLALLSVNSDTLL 

FHYLFATCNCIQGPFIFLSYWLSKEVRKALK 

LACSRKPSPDPALTTKSTLTSSYNCPSPYADG 

RLYQP\YGDSAGSLHSTSRSGKSQPSYIPFLLR 

EESALNPGVQGPPGIXKjIPGR/LCTLGRFKDQQ 

IM)S*TRDFDSDLSLEDDOSGSYASTHSSDSEE 
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SEQ ID 
NO; of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
914 


Predicted 
beginning 
nucleotide 
location 

mrrp^nondi 

ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
tolast amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alanine C^Cysteine, 
D^Aspartic Acidi EK}Iutamic Acid, 
F=Phenylalanine» G=01ycine, H=Histidine, 
Msolcucine, K=Lysine^ D=Leucine, 
M=Methioninc, l»^Asparagine. P^Prolinc, 
Q=Glutamine, R=Aigiiune, S=Scriiie» 
T«Threoninc, V-Valine, W^Tiyptophan, 
Y=Tyrosine, X=Unlaiown, ♦«=6top codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














EEEEEEEEAAFPGEQGWDSLLUPUAbKLl'l-MS 
TPKDGGPGPGKAPWPGDFGTTAKESSGNGAP 
EERLRENGDALSREGSLGPLPGSSAQPHKGDL 
KKKCLPTISEKSSLLRLPLEQCTQSSROSSASE 
GSRGGPPSRPPPRQSLQEQLNGVMPIAMSnCA 
GTVDEDSSGSEFLFFNFLH 


423 


1775 


A 


3429 


155 


1417 


GEPAVQSCDCGCTQRSCPWLLVAPGLLSSSSS 
RAASVREAEDAPLQPASIHPVSQGSRGPEGSL 
n<3AFrT ponPT GARRATRAHSPVPGPPPSLPA 
AGTAVKRGLQPG*GA/GATSTPGTGAATGGL 
CGPAWAAPSAVGPCCCCPSISTTPSQMRSARP 
SLGCLPSWAS\PGTEHPPGPQGPGPS»DLCSV* 

krefqrgpwagmvilhrisaadparapgpds 
nlqsalqqpatgcsepaavysppiglwga**p 
eyg*pqhslpg*tapadr*pwgikdrvysnsi 
yellengqragtcvlbyatplqtlfamsqys 
qagfsredrleqaklfcrtlediladapesqn 
ncrliayqepaddssfslsqevlrhlrqeeke 

EVTVGSLKTSAVPSTSTMSQEPELUSGMEKP 

lplrtdfs 


426 


1776 


A 


3431 


1662 


369 


■ aiwwlswlqhdllptptqvaidftasngdpk 
ssqslhclsprqpnhylqalravggicqdyd/ 

c-v/r^TJcnArsnTJTjnfTr;! AORlPOrFl LPSDKRFP 

afgfgarippnfevg*mrgkegdggrvsqae 

kagphcsrlaltgvshdfainfdpenpecegk 

rgdfhlprlpadtlhtgaqtplpraqlpvpst 

hprpvfineisgviasyrrclpqiqlygptnvap 

unrvaepaqreqstgqatkysvllvltdgv 

vsdmaetrtaivrasrlpmsinvgvgnadfs 

dmrlldgddgplrcprgvpaardivqfvpfr 

dfkdvsppgpfrlkdssashppksdlrlppfd 

vllrtrepswpp*sptspsddpasptlpltpnhi 

tvptl\aapsalakcvlaevprqvveyyasq 

GISPGAPRPCTLATTPSPSP 


427 


1777 


A 


3446 


79 


9748 


gcqscwpawprlrrrgpasagarlgrkafW 
glpgrvqdgrplrfcfylrprapfiapvlsga 
asrpeasgdcragretamatleklmkafesl 

KSFQQQQQQQQQQQQQQQQQQQQQQQPPPP 

pppppppqlpqpppqaqpllpqpqpppppppppp 

gpavaeeplhrpkkelsattckdrvnhcltic 

entvaqsvrnspefqkllgiamelfllcsdda 

esdvrmvadeclnkvkalmdsnlprlqlel 

ykeibmjgaprslraalwrfaelahlvrpqk 

crpylvnllpcltrtskrpeesvqetlaaavp 

kimasfgnfandnbikvllkafianlkssspn 

rrtaagsavsicqhsrrtqyfyswllnvllo 

llvpvedehstllelgvlltlrylvpllqqqv 

kdtslkgsfgvtrkemevspsaeqlvqvyel 

txhhtqhqdhnwtgalellqqlfrtpppel 

lqtltavggigqltaakeesggrsrsgsiveli 

agggsscspvlsrfcqkgkvllgeeealedds 

esrsdvsssaltasvkdeisgelaassgvstpg 

saghdiiteqprsqhtlqadsvdlascdltss 

atdgdeedelshsssqvsavpsdpamdlndg 

tqasspisdssqtttegpdsavtpsdsseivld 

gtdnqylglqigqpqdedeeatgilpdeasea 

FRNSSMALQQAHLLKNMSHCRQPSDSSVDKF 
VLRDEATEPGDQENKPCRIKGDIGQSTDDDS 
APLVHCVRLLSASFLLTGGKNVLVPDRDVRV 
SVKALALSCVGAAVALHPESFFSKLYKVPLD 
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SEQ ID 
NO: of 
dug]' 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod. 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alaninc OCysteine, 
D=Aspartic Acid, E^Kjlutamic Acid, 
F^Phenylalanine^ Q=Glydn&, H=Histidine, 
I==Isoleucinc, K=Lysine, L=Leucine, 
M=Methioniiie, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine5, V«Valinc, W-Tiyptophan, 
Y=Tyrosinc, X=Unknown, ♦s^top codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














TTEYPEEQYVSDILNYIDHOUPQVRGATAILU 

GTLICSILSRSRFHVGDWMGTIRTLTGNTFSL 

ADOTLLRKTLKDESSVTCKLACTAVRNCVM 

SLCSSSYSELGLQLHDVLTLRNSSYWLVRTEL 

LETLAEIDFRLVSFLEAKAENLHRGAHHYTGL 

LKLQERVLNNWIHLLGDEDPRVRHVAAASL 

IRLVPKLFYKCDQGQADPWAVARDQSSVYL 

KLLMHETQPPSHFSVSTrrRIYRGYNLLPSITD 

\rrMENNLSRVIAAVSHELTrSTTRALTFGCCE 

ALCLLSTAFPVCIWSLGWHCGVPPLSASDESR 

KSCTVGMATNflLTIXSSAWFFlDLSAHQDAL 

ILAGNLLAASAPKSLRSSWASEEEANPAATK 

QEEVWPALGDRALVPMVEQLFSHLLKVINIC 

AHVLDDVAPGPAIKAALPSLTNPPSLSPIRRK 

GKEKEPGEQASVPLSPKKOSEASAASRQSDTS 

GPVTTSKSSSlXiSran-PSYUaJIDVLKATHA 

NYKVTLDLQNSTEKFGGFLRSALDVLSQILEL 

ATLQDIGKCYEEILGYLKSCySREPMMATVC 

VQQLLKTLFGTNLASQFDGLSSNPSKSQGRA 

OiaGSSSVRPGLYHYCFMAPYTHFTQALADA 

SLRNMVQAEQENDTSOWFDVLQKVSTQLKT 

hn-TSVTTO^RADKNAIHNHIRlJ^PLVI^^ 

YmrCVQLXJKQVLDLLAQLVQLRVNYClX 

DSDQVFIGFVLKQFEYIEVGQFRESEAIIPNIFF 

FLVLLSYERYHSKQHGIPKnQLCDGIMASGR 

KA\n:HAIPALQPIVHDLFVLRGTNKADAGKE 

LEIX5KEVWSMLLRLIQYHQVLEMFILVLQQ 

CHKENEDKWKJEaSRQIADHLPMLAKQQMHI 

DSHEAIX5VLNTLFEILAPSSLRPVDMLLRSMF 

VTPNTTVIASVSWQLWISGILAILRVLISQSTED 

IVLSRIQELSFSPYUSCTVINRLRDGDSTSTLE 

EHSEGKQKNLPEETFSRFLLQLVGILLEDIVT 

KQLKVEMSEQQHTFYCQELGTLLMCLIHIFKS 

GMFRRITAAATRLFRSDGCGGSFYTLDSLNLR 

ARSMTnWALVLLWCQILLLVNHTDYRWW 

AEVQQTPKRHSLSSTKLLSPQMSGEEEDSDLA 

AKLGMCNRETVRRGALILFCDYVCQNLHDSE 

HLTWLIVNHIQDUSLSHEPPVQDFISAVHRNS 

AASGLFIQAIQSRCENLSTPTMLKKTLQCLEGI 

HLSQSGAVLTLYVDRLLCTPFRVLARMVDIL 

ACRRVEMLLAANLQSSMAQLPMEELNRIQEY 

LQSSGLAQRHQRLYSLLDRJFRLSTMQDSLSPS 

PPVSSHPLDGDGHVSLETVSPDKDWYVHLVK 

SQCWTRSDSALLEGAELVNRIPAEDMNAFM 

MNSEFNLSLLAPCLSLGMSEISGGQKSALFEA 

AREVTLARVSGTVQQLPAVHHVFQPELPAEP 

AAYWSKLNDLFGDAALYQSLPTLARALAQY 

L\WSKLPSHLHLPPEKEKDIVKFVVATLEAL 

SWHUHEQIFLSLDLQAGLDCCCLALQLPGL 

W^WSSTEFVTHACSLIYCVHFILEAVAVQPG 

eollsperrtntpkaiseeeeevdpntqnpkyi 
tIvcemvaemveslqsvlalghkrnsgvpa 

FLTPLLRNIIISIARLPLVNSYTRVPPLVWKLG 

wspkpggdfgtafpeipveflqekevfkefiyr 
intlgwtsrtqfeetwatagvlvtqplvme 

OEESPPEEDTERTQINVLAVQAITSLVLSAMT 

>^vagnpavscleqqpiwkplkaldtrfgrk 
lsiiroiveqeiqamvskreniathhlyqawd 
pwslspattgausheklllqinperelgsms 
yklgqvsmsvwlgnsitplreeewdeeeeee 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 1 
NO: of, 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

[DNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A~Alanine OOysteinc, 
DrAspartic Acid, EF=Glutamic Acid, 
F=Phenylalanine, GKilycine, H=Histidinc, 
Msolcucine, K^Lysine, L^Leucine, 
M=Methioninei hNAsparagine, P=Pioline, 
QKjlutamine, R=Arginine, S=Serine, 
T^Threonme, V=Valinc, W=Tiyptophan, 
Y=Tyrosinc X«Unknown. ♦-Stop codon. 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














ADAPAPSSPPTSPVNSRKHRAGVDIHSCSQFJL 

LELYSRWILPSSSARRTPAILISEVVRSLLWS 

DLFTERNQFELMYVTLTELRRVHPSEDEILAQ 

YLVPATCKAAAVLGMDKAVAEPVSRLLESTL 

RSSHLPSRVGALHGBLYVLECDLLDDTAKQLI 

PVISDYLLSNLKGIAHCVNIHSQQHVLVMCAT 

AFYLIENYPLDVGPEFSASIIQMCGVMLSGSE 

ESTPSIIYHCALRGLERLLLSEQLSRLDAESLV 

KLSVDRVNVHSPHRAMAALGLMLTCMYTG 

ifPifV<5PrfllTSDPNPAAPDSESVIVAMERVSVL 

FDRIRKGFPCEARWARILPQFLDDFFPPQDIM 

NKVIGEFLSNQQPYPQFMATWYKVFQTLHS 

TGQSSMVRDWVMLSLSNFTQRAPVAMATWS 

LSCFFVSASTSPWVAAILPHVISRMGKLEQVD 

VNLFCLVATDFYRHQIEEELDRRAFQSVLEV 

VAAPGSPYHRLLTCLRNVHKVTrC 


428 


1778 


A 


3449 


3 


430 


isjcpp<iP<iAAT VFVT LRSGSTKPHTVSGGWAA 

WGPWSSCSRDCELGFRVRKRTCTNPEPRNGG 

LPCVGDAAEYQDCNPQACPVRGAWSCWTS 

WSPCSASCGGGHYQRTRSCTSPAPSPGEDICL 

GLHTEEALCATQACPEGWS 


429 


1779 


A 


3464 


583 


3 


DALDRRYLERCHPAAGGWVGEGE" AL<jyK.l/ 

RFSGVLEPPLPSLKDGGRFPAWT*RSCSKSLR 

AAFTSQFFPSRRSRASPGSAPVGNGQNLTEQHP 

CPGSCDPQVLSASWM*VEHRSKFRPPP*NSTI 

PPES/RS*QGGTVQTGQHSSGREAGSWRARGR 

NAGRR^KGGGKIGIKQGAVRARKECRGEMA 

SGETDSE 


430 


1780 


A 


3473 


2802 


270 


FRMRIFLHCPWNQQMWKlWNLLKTSLfcSUlLA 

HLSIQKLLKERNQNQLPVFKHRDSIVETLKRHR 

VVWAGET\GSGKSTQVPHFULBDLLLNEWB 

ASKa^^IVCTQPRRISAVSLANRVCDELGCENG 

PGGRNSLCGYQIRMESRAGESTRLLYCTTGV 

LLRKLQEDGLLSNVS/HMFIVDEV\HER\SVQS 

DFLUILKEILQKRSDLHLILMSATVDSEKFST 

YFTHCPILRISGRSYPVBVFHLEDIIEETGFVLE 

KDSEYCQm^EEEVTINVTSKAGGIKKYQE 

YIPVQTGAHADLOTFYQKYSSRTQHAILYMN 

PHKINLDLILELLAYLDKSPQFRNIEGAVLIFL 

PGLAHIQQLYDLLSNDRRFYSERYKVIALHSI 

LSTQDQAAAFTLPPPGVRKIVLATNIAETGrn 

PDWFVIOTGRTKENKYHESSQMSSLVETFVS 

KASALQRQGRAGRVRDGFCFRMYTRERFEG 

FMDYSWEILRVPLEELCLHIMKCNLGSPEDF 

LSKALDPPQLQVISNAMNLLRKIGACELNEPK 

LTPUjQHLAALPVNVKIGKMLIFGAIFGCLDP 

VATLAAVMTEKSPFrnPIGRKDEADLAKSAL 

AMADSDHLTIYNAYLGWKKARQEGGYRSEI 

TYCRRNFLNRTSLLTLEDVKQELKLVKAAGF 

SSSTTSTSWEGNRASQTLSFQfilALJJSA V L v a 

GLYDNVGKnYTKSVDVTEKLACrVETAQOK 

AQVHPSSVNRDLQTHGWLLYQEKIRYARVY 

LRETTLITPFPVLLFGGDIEVQHRERLLSIDGW 

ryPQAPVmVIFKQLRVUDSVLRKKLENPK 

MSLENDKILOniELIKTENN 


431 


1781 


A 


3474 


I 


441 


FRPAPGHVQP*GGSSAAAGGGLLSHPRKW 

PCPPAPAPSRPRSLGSLGQRVPAALATAAQEL 

PATLGGDGGKPALTAGEAALPGLHRSGVPAA 

AARC*PCT/SRPT*STLSPTQAAWWCRPSRRQ 

QROEASIGGASGRRCGSa^OV 
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SEQID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQID 1 
m:of \ 
peptide 
seq- 
uence 


Viet 
lod 


5EQ 

[DNO: 

in 

09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
^quence 


Amino acid sequence (A^AIanbe C=Cysteine, 
I>Aspartic Acid, E=Glutamic Acid. 
F=Plienylalanine, G=Glycine, H=Histidine, 
l=Isoleucine, K=Lysine, L«=Lcucine, 
M=Methionine, N=Asparagine, P=Proline, 
Q»GlutaminQ» R»Arghine, S'^'Serine, 
T«Threonine, V-Valinc W^Tiyptophan. 
Y=<ryrosine. X«4Jnknown. *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 


432 


i7to 


A 


3478 


416 


23 


QLRRLTLPNFKTY/YSS*IIElAWH»»KNMgiU 
QWFRRESPEIDLCXYS*LSFDKEAKJ\IK/WKE 
CSLFNKWaYKNWM/LHVQKKW*VQTLHPS 
QKlX\SKWIKDLNVECaUTKIXDQEYPGDLGY 

SRALNSGSR 


433 


1783 


A 


3504 


1876 


"552 


■ CLAPCSPQPEKNGMQPLUiLlRPLLYQQLLHti 
SLGAPGBSTLLVRTSKLLVOLOLQLLVWLLL 
QTRSLLALQLHLTSSAPLLAAPTAVCSCSRCS 
APRSRCVARPAARTGLPTPAPASSPAPAASPA 
PAASPAPAESTAVPQPLILLPKP/PPAPGAPPPRP 
GAPPPRPAASPSPAASPAPPAASPVLTASPPLP 
AASPSPAASPAPPAASPVLTASFPLPAASPSPA 
ASPAPPAASPVLTASPPLPAASPALAASPVHT ' 
ASPPVHVASPPVHTASPPVHVASPPVHTASPP 
VHVASPPVHTASPHVHVASPPVHVASPPVHV 
ASPPVHTASPPVHVASPPVHTASPHVHVASPP 
VHTASPPVHVASPPVHVASPPVHVAYPPVHV 
ASPPVHVASPPVHVASPPVSCSGDSTSDCFIT 
OPGAVFPHSLAPSLGGWSHLVAALP 


434 


1784 


A 


3516 


142 


590 


GGVNRPRSEreQVKTPVLlSSWDYRHPFfKi'A 
SFPWLVnXJnTALARMNTUSWPCDLPTSASQ 
SAGrXGVRHHAVRLLYFEQESHSVTQAGWWQ 
WHNLGSLQPLSLEDRLSPGVLGCSALCRSGV 
RTKFGINMVTSRERGTTRLPKEG 


435 


1785 


A 


3529 


1 


3161 


MSLVRAALEALDELDLFQVKGGPQSYIHVLA 

DEVQHCQSILNSLLPRASTSKEVDASLLSWS 

FPAFAVEDSQLVELTKQEIITKLQGRYGCCRF 

ljmGYKTPm)PNRLYY/ENPAELKLFENIEC 

EWPLFWTYFILDGVFSGNAEQVQEYKEALEA 

VLKGKNGVPLLPELYSVPPDRVDEEYQNPHT 

VDRVPMGKLPHMWGQSLYILGSLMAEGFLA 

PGEIDPLNRRFSTVPKPDWVQVYPSLPHGCS 

SKSPSHQCrnSmTTRKrrAPVSILAETEEIKTIL 

KDKGIYVETIAEVYPIRVQPARILSHIYSSLEIF 

LPFLNSVSGCNNRMKLSGRPYRHMGVLGTSK 

LYDIRKTIFTFIPQFIDQQQFYLALDNKMIVE 

MLRTDLSYU:SRWRMTGQPT1TFPISHSMLDE 

DGTSLNSSILAALRKMQDGYFGGARVQTGKL 

SEFLTTSCCTHLSFMDPGPEGKLYSEDYDDN 

YDYLESGNWMNDYDSTSHARCGDEVARYL 

DHLLAHTAPHPKLAPTSQKGGLDRFQAAVQT 

TCDLMSLVTKAKELHVQNVHMYLPTKLFQA 

SRPSFNLLDSPHPRQENQVPSVRVEIHLPRDQ 

SGEVDFKALVLQLKETSSLQEQADILYMLYT 

MKGPDWNTELYNERSATVRELLTELYGKVG 

HRHWGLIRYISGILRKKVEALDEACTDLLSH 

QKHLTVGLPPEPREKTISAPLPYEALTQLIDEA 

SEGDMSISILTQEIMVYLAN4YMRTQPGLFAE 

MFRLRIGLnqVMATELAHSLRCSAEEATEGL 

SNVSPAISIHEIGAVGATKTERTGIMQLKSEIK 

OSPGTSNfTPSSGSFPSAYDQQSSKDSRQGQW 

QRRRRLDGALNRVPVGFYQKVWKVLQKCH 

GI^VEGFAnLPSSTTREMTPGEIKFSVHVESVL 

NRWQPEYRQUVEAn-\VLTMLADIEI\HSIGS 

nAVEKIVHIAOTU^QEQKTLXJADDTMI^ 

PASGICTLLYDSAPSGRFGTMTYLSKAAATY 

VOEFLPHSICAMQ 


436 


1786 


A 


3546 


73 


393 


" CP*LTWELLEVKKAEVLQDSLDGRYt>irsSCL 
EQPDSCRPYGRSFYALEEKHVIFSLDVGETDN 
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^0 acid sequence (A=-Alanine C=Cysteine, 
)=Aspartlc Acid, E=uiuiamic acio, 
'^Phenylalanine, G=G!ycine, H=Histidine, 
=lsoleudne, K^Lysine, L=Leucine, 
^=^ethionine, N=Asparagine, P=Proline, 
3H31utaminc, R«Arginine, S^Soine, 
r^Threoninc, V=Vaiine, W=Tiyptophan, 
f=Tyrosine, X=Unknown, *=Stop codon. 
f=possible nucleotide deletion, \=possible 
nucleotide insertion 














KGKGKTIRGl*lTKGRKGGTyQREHDANPLA 

PXSARSCWMRKG 

AVRAEPGLEELSSGLRAHSPSAITVCEPEAQU 


[437 


1787 1 
• 




3554 


5157 j 
• 1 


2939 


SASGCRYAAHPHWGLGGAAAAGGSWEPQPP 
RPVCEPAGRGKPHPPAAPRSPLLPGSRRJRPHA 
AOPGARARTSPmSAIWMAARPAATLAWSL 

llLssallregcrarfvaerdseddgeepwf 

PESPLQSPTVLVAVLARNAAHILPHFLGCLER 

LDYPKSRMAIWAATDrom)im^IHlEWLK 

NVORLYHYVEWRPMDEPESYPDEIGPKHWP 

TSRFAHVMKLRQAALRTAREKWSDYILFIDV 

DNFLTNPQTLNLLIAENKTIVAPMLESRGLYS 

NFWCGITPKGFYKRTPDYWQIREWKRTGCFP 

VPMVHSTFLIDLRKEASDKLTFYPPHQDYTW 

TFDDTIYFAFSSRQAGIQMYLCNREHYGYLPIP 

LKPHOTLQEDIENLIHVQIEAMIDRPPMEPSQ 

YVSWPKYPBKMGFDEIFMINLKRRKGQGGD 

RWLRTLYEQEIEVKIVEAVDGKALKrSQLKA 

LNIEMLPGYRDPYSSRPLTRQEIGCFLSHYSV 

WKBNHDREt^TLVIEDDVRFEHQFKKKLMK 

LMDNIDQAQLDWELIYIGRKKMQVKEPEKA 

VPNVANLVEADYSYWTLGYVISLEGAQKLV 

GANPFGKMU»VDEFLPVMYNKHPVAEYKEY 

YESRDLKAFSAEPLLIYPTHYTGQPGYLSDTC 

TSTIWDNETVATDWDRTHAWKSRKQSRIYSN 

AKNTEALPPPTSLDTVPSRPEL 


438 


1788 


A 


3563 


130 


1 527 


IFFNSSSLFCRVFCUlRWSFTLVAgAKv^i-v- 
NLSSLQPLPPGFK*FSCLSPPRS*DYRRPPPRPA 
NFLYF**RQGFTVLGQAGLELLT/S/GDPPTSA 
SQSAGITGVSHRAWPVHAISTHISLVKTRPSLT 

u5pAMRKSFGLSDCLWAWlLLLSll.TCKl>y ~ 


439 


1789 


A 


3565 


446 


1834 


GOPSLQDELKDNTTVFTRILDRLLDGYDNRL 

RTOLGERVTEVKTDIFVTSFGPVSDHDMEYTI 

DVFFRQSWKDERLKFKGPMTVLRLNNLMAS 

KIWTPEOTFHNGKKSVAHNMTMPNKLLRTTE 

DGTLLYTMRLTVR\AECPMAFGRDFPMM3\AH 

ACPLKFGSYAYTRAEVVYEWTREPARSVVV 

AEDGSRLNQYDlXGQTVDSGWQSSTGErA^ 

MTraFHLKRKJGYFVlQTYU>CIMTVILSQVS^^ 

WLl^SVPARTVFGVTTVLTMTTLSISARNSL 

PKVAYATAMDWFIAVCYAFVFSALIEFATVN 

YrnCRGYAWDGK3VWEKPKKVKX)PLIKra^^ 

NTYAPTATSYTPNLARGDPGLATIAKSATIEP 

KEVKPETKPPEPKKTFNSVSKIDRLSRIAFPLL 

FGTFNLVYWATYLNREPQLKAPTPHQ 


440 


1790 


A 


3568 


1 


350 


- "ITSSCFPAAAAAIMKHI VHLQAGQCGNgiCi/^ 
FWEVISDEHGIDPTGTYHGDSDLQLERINVYY 
NEATGEAPVPSPTALRGPRGPCLG*RPPVPAG 
OKYVPT? A VT.VDMEPGTMDSV 


441 


1791 


A 


3569 


2 


TT75i 


- FVAVAGAVSGKPLVHWCTQQLRK'l bijLV V S 
EEDQYVLSIESAEEmEYVTDLLQGNEGKKGQ 
Fffia.rrKWQKNDQELISDPLQQCFKKD]ED-DG 
QKSGDHLKRGRKKGFNRQEVPAFTEPDTTAE 

VKTPFDIJ^KAQENSNSVKKKT^^ 

ODRl^VLLPGRHPCDCLGQKHKLmNCLICG 

MVCEQEGSGPCLFCGTLVCTHEEQDILRGDS 

N«SQKLIiCKLMSGVENSGKVDISTKX)IXPH 

OELMKSGLEKA^^H^KLLEFDRTSIRRTQVI 
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Ycdictcd end > 
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:orresponding 1 
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^no acid sequence (A-Aianmc OK^steinc, 

>=Aspartic Acid, E^Glutamic Add, 

^^Phenylalanine, G=Glycine, H«Histldine, 

=lsoleucine, K=Lysine, Lr=Leucine, 

S4=4vIethiomne, N^Asparagine, P^Proline, 

5=0lutBmine, R^Argiune, S=Serine, 

r-Thrconine, V=Valinc, W=Tryptopban, 

Y=Tyrosine, X^Unknown. ♦^top codon, 

/_«««pc;ki«i TitirlpAtidp deleticMiL V^^DOSSiblc 
^^OSSiDJe nucicuiiuc uviwi.i*iii» 

niirlcflttde insertion ., „ 
















DDESDYFASDSNQWLSKUO^lETLQKRbbliLK 

ELRHASRI^KKVTIDFAGRKILEEl^SLAEYH 

SRLDEriQAIANGTLNQPLTKLDRSSEEPLGVL 

VNPNMYQSPPQWVDHTGAASQKKAFRSSGF 

GLEFNSFQHQLRIQDQEFQEGFDOGWCLSVH 

OPWASLLVRGKRVEGRSWYTPHRGRLWIAA 

TAKKPSPQEVSELQATVRLLRGKDVEFPNDY 

PSGCLLGCVDLIDCLSQKQFKEQFPDISQESDS 

PFWICKNPQEMVVKFPDCGNPKIWKLDSKIH 

OnAKKGLMKONKAV 




442 


1792 


A 


3576 


1 


2019 


■MPRSHTGERI.CEGKEGSQCAENFSPNLSV IK 
KTAGVKPYECTICGKAFMRLSSLTRHMRSHT 
AIRAI\EKPYKCKEC\GRAFSLSQILSKai^^ 
TGEKPYKCKQCGKTFIYHQPFQRHERTfflGEK 

pyeocqcgkalscssslrvherihtgekpyk; 

KOCGKAFSCSSSIRVHERTHTGEKPYACm: 
GKAFISNTTSXa-THNOTHNGDRPYKCKEOGKA 

fifpsflrvherihtgekpykckqcgkafrws 

TSIQIHERIHTGEKPYKCKECOKSFSARPi^y 

hvrvhtgekpykckecgkafsrisyfrihert 

HTGEKPYECKKCGKTFNYPLDlJUHJfwKiNn i u 

EKPYECKECAKTnSLENFRRHMnHTGDGPY 

T^rynT\nnv\rTmntQAi PTHERTHTGEKPYECKQ 

CGKAFSCSSYIRIHKRTHTGEKXPYECKECGK 

AFIYPTSFQGHMRMHTGEKPYKCKECGKAFS 

LHSSFR\RmTUHNYEKPLEC*Q\CGKAFSV^^ 

LKKPMRNAQSDRKLY/KCEK*EKVFNSm^^ 

QSCENSH*REKSCQCIC*YRKRDTR*FMYSQV 

PHNHVSVSNGPYR/CGSPIRLYNT^NISINRNL 

VAVVTP*CSTLFKCLWCWCKRAALSW*/IVQ 

DSGRGRWLTPVIPALWEAKAGGSRGQEIKTIL 

ANTVKPHLY 


443 


1793 


A 


3578 


287 


114 


"PFYERKFEQFIEGHKQlVNKWRDLLCSWKKfw 
T ciTiprcvrniMMT .♦FSAASMRFOKVFF 
HLFFSLFLAAMAMTGSTPCSSMSNKlKbRV 1 


444 


"1794 ' 


A 


" 3582 


"3335 


• 


MTKVTLENFYSNLIAQHEEREMRQKKLEKV 

MEEEGLKDEEKRLRRSAHARKETEFLRJUm 

RLGLEDFESLKVIGRGAFGEVRLVQKKDTGH 

VYAMKILRKADMLEKEQVGHIRAERDILVEA 

DSLWVVKMFYSFQDKLKLYLIMEFLPGGDM 

MTLLMKKDTLTEEETQFYIAEIVLAIDSIHQL 

GFIHRDIKPDNLLLDSKGHVKLSDFGLCTGLK 

KAHRTEFYRNUmSLPSDFTFQNMNSKRKAE 

TWmmRQLAFSTVGTPDYlAPEVFMQTGTO 

KLCDWWSLGVIMYEMLIGYPPFCSETPQETY 

KKVMNWKETLTFPPEVPISEKAKDLILRFCCE 

WEHRIGAPGVEEIKSNSFFEGVDWEHIRERPA 

AISIEKSIDDTSNFDEFPESDILKPTVATSNHPE 

TDYKl^WVFINYTYKRFEGLTARGAIPSYM 

ICAAKt 

- -RTRGlEKRFAYSFLQQtlK Y VDEAHQYlLbhU 


445 


~ 1795 


""A" 


" 3584 






GGSRGKGEHFPYEQEKFFAKWLPLIDQYFK 

NHRLYFLSAASRPLCSGGHASNKEKEMVTSL 

FCKLOVLVRHRISLFONDATSIVNCLHILGQT 

LDARTVMKTGLESVKSALRAFLDNAAmE 

KTMENIXQGQFTHTRNQPKGVTQIINYTTVA 

LLPMLSSLFEfflGQHQFGEDLILEDVQVSCYRI 

LTSLYALGTSKSIYVERQRSALGECLAAFAGA 

FPVAFLETHU)KHNrySIYNrKSSRKtAALSLP 

TNVEPVCPNIPSLEKLMEEIVELAESGIRYTQ 
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Amino arid sequence (A=Aiaiiiiie C=uysteui^ 
t>=Aspartic Acid, B=Glutaniic Add, 
F=PhenylalBnine, G=Glycins, IWBstidine, 
]=Isoleucine, K-Lysine, LHLcudne, 
M=Methionine,N=Asparaginc P=Pl0lin^ 
Q=Glutamine, R=Arginiiie, ^erin^ 
T=Threonine, V=Valine, W=^typtophan, 
Y=Tyrosine, X=Unlcnown, *=Stop codon, 
^ssible nucleotide deletion, \-possible 
nucleotide inseriion 



nucleoti de insertion ^„ 

WHVNEVlLPMIXSYMSRWWliHCil'bl'll'W'iiK 

AEMa:TALNSEHM^m.l^^^LIaIyN^xsa)E 

GAWMKRLAVFSQPIINKVKPQLLKTHFLPLM 
EKLKKKAATWSBEDHLKAEARGDMSEAEL 
Ln-DEFTTLARDLYAFYPLURFODYNRAICWL 
KEPNPEAEELRRMVAEWrfWSKSHNFKKK 
^,^]pwn^IP.INNMSFLr^DTKSKMSKAAVSDQ 



ERKKMKRKGDRYSMQTSUVAAUCKLl^iu^ 
NICAPGDQELIAI^NRFSLKDraDEVRDTOS 
NWLQGKLEDPAIRWQMALYKDLPNKnajre 
nPEidvERVLDlANVLFHLEQKSKRVGRRHY 



CLVEHPQRSKKAVWHKLLSKQRKKAVV^r 
RMAPLYNLPMIRAVhlLFLQOYEKSWffirE^ 
YFEDKUEDLAKPGAEPPEEDEOTKRVDPLHQ 
,.,Tp<:oTAT.TEKCKLEEDaYMAYADIMAKS 



CHDEEDDDGEEEVKSFEEKEMEKVjJU^xw 
ARLHDRGAAEMVLQTBASKGETGPJ^MT 
^EGIAILNOGNSIVQQKMLDYL^m)VGF 
Tj/vji AOi MOSCSVLDLNAFERQNKAEGLGM 



NSDFQhm.RTQTGNKITVNraSTVDYLLRVQ 
ESISDFYWYYSGKDVIDEQGQRNFSKAIQVA 
KQVFmTEYIQOPCTONQQSI^SRLWDAV 
VGHJIWAHMQMKLSQDSSQIELLKELMDLQ 
KDMVVhftt.LSMLBQNN^GTIGKQNm3MLV 




QGPRMAFFSILTWSALFALRYWILTLNUU^^^^ 
LKSUOCQMKKVKKMTVKDMVTAFFSSYWSI 
FWrrLLHFVASVFRGFFRnCSLLLGGSLVEGA 
l%^^^7v, I ANMPnFrODEVRGDGEEOERKP 



LEAALPSEDLTOUCELTEESDLLSU^OL^*^ 
EGGQYiaiPHNPNAGLSDLMS>n^MPEVQE 
KFQEQKAKEEEKEEKEETKSEPEKAEOroGE 
KEKAKEDKGKQKLRQLHTHRYGEPEVPKA 
FWKKIIAYQQKLLNYFARNFY^n^R^^AIJV 
Arl^pniiVKVSTSSWEGKELPTRSSSENA 



KVTSLDSSSHRIIAVHYVLEESSGYMEf 1 VJUi. 
SLm^ISFFCnGYYOXVPLVIFKBEia^Ai^ 
LEFDGLYITEQPSEDDKGQWDRLVTOTQSFP 
i^nramiTFVKRKVMDKYGEFYORDRISEUXS 



Y>KSEIX3DTPDMKCDDMLTCYh4FromGV 

RAGGGIGDEIEDPAGDEYEIYRnFDrtnWW 

VlLLAnQGLnDAFGELRDQQEQym>NffiTKC 

FICGIGNDYFDTVPHOFETHTLQEHNLAim-F 

m^INKDETBmXKJESYVWCMYQERCWE 

piTPAnn CFRKOYEDOm I- 

AGLELLNSDUPPALASQSAGlTUVlRlPiUI.' 

DTVIJXCSGWSAVAPSRLTAAU^*AQAV<X 

SLPRSWDYRRW/PPHPANFCIFCRDE/SLAyML 

PRLVSNSWTQAILLPKPPKMLOLQV _ 
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Sino acid sequence (A«Aianmc C=Cystcmc. 
>=Aspartic Add, EOlutamic Acid, 
^Phenylalanine, G=<jlycine, H=Histidinc, 
=Isoleucihe,K«Lys!ne,L=Leucine, 
^^Metiuonine, N^Aspaiagine, P=Prolme. 
WJlutaminc^ RpArgiune, S«Serinc, 
D=Threonine. V=Valinc W-Tryptophan. 
y^=Tyit>sine; X^Unknown, ♦=Stop codon, 
r«possible nucleotide deletion, \=possible 
nvcleotide insertion _| 


447 


1797 J 


K : 


)S98 


1202 


1070 


LFVGGGPlCPEGASOKAPUPAl'APRVoVlJAtV 

QR«V»GAAASQaAA3SLRPMTGPGHPGAWL 

OVWOAAAVCAQPAM*/AVRAKROPRAG*EP 

NSPWRSOVLAA\RAVGAGPWP*P*PGCS*ARG 

PSSRSAPOLASGPAAPLLQOVHSSAOPLLCYI 

HOn ATr,T.KP»»AWGWGEWRPKG 


448 


1798 


A 


3604 


3115 


557 


FRRKaOGGPKDFOAGLK.YNSRHEKVNULfct i 

GVEFU'VNNVKKVEKHOPORWVVI^^aJ^^ 

UXVUUSlOn-VWHLQYRDVRVQKVFNGYM 

RITNENFVDAYENSNSTEFVSLASKVKDALKL 

LYSGVm-GPYHKESAVTAFSEGSVlA 

FSffOHLVEEAERVMAEERVVMLPPRARSLKS 

FWTSWAFPTDSKTVQRTQDNSCSFOLHAR 

OVEIAIWTTPGPPDSPYPAHARCQWAI^OT 

ADSVl^TWSFDIASCDERGRHLV\TVY>mL 

SPMEPHAVLVQLCX3TYPPSYNLTFHS\S\QNVL 

LmmnERRHPG^^ATFPQLPRMSSCGGRL 

WoOTTOSPYYPGHYPPNIDCTWhnEXaW 

^^^^TOTKFFnXEPGVPAOTCPKDYVEING 

^CGERSQFVVTSNSNKITVRFHSDQSYTDT 

OTJiEYLSYDSSDPCPGQFTCRTGRCIRKELR 

aWWADCTDHSDELNCSCDAGHQFTCKNKP 

CKPLFWVCDSLNDCGDNSDEQGCSCPVAQTF 

RCSNGKCLSKSQQCNGKDTCGDGSDEASCP 

KVNWTCIKHTYRCLNGLXXSKGNPECDGK 

EDCSDOSDEKIXDCGiaSFTRQARWOGTO 

ADEGEWPWQVSLHALGQGHICGASLISPNWL 

VSAAHCYIDDRGFRYSDFrQWTAFU3LHDQS 

QRSAPGVQERRLKRHSHPFI^FTFDmALL 

ELEKPAEYSSMVBPICLPDASHVFPAGKAIWV 

TGWGHTQYGGTGALILQKGEIRVINQTTCEN 

LLPQQrrPRMMCVOFLSGGVDSCQGDSOGPL 

SSVEADGRIFQAGWSWGDGCAQRNKPGVY 

TRT.PLFRDWIKENTGV 


• 449 


1799 


A 


3618 


2 


613 


jrVS^WRMDOSTERLEARRPAGRU-WSSKy | 
EKnTUU-SU^GRQHGWSAQQ^WAOTVPG 
ANPDLU'HFLGEPEDVYIVKWa'VLLVCKAV 
PATQIFFKCNQEWVRQVDHVERSTDGSSGIP 
TMEVR]NVSRQQVEKVI'GLEEYWCQCVAWS 
SSCnTKSQKAYIRIAYLRKNFEQH>LAKEVSL 
pOftjvi.prRPPEGIPPAE J 


450 


1800 


A 


3620 


1 


" 2676 


MEPSLGQGMDLTCPFUVtiPACOAQASWSlbO 

ADAAEVPGTRQHSQQEAAMPHIPEDEEPPGE 

PQAAQSPAGQQGPPTAGVSCSPIPnVLTGDA 

•rePEGETDKNLANRVHSPHKRLSHRHLKVST 

ASLTSVDPAGHIIDLVNDQLPDISISEEDKKia^ 

LALLEEAKLVSERFLTRRGRKSRSSP(^SPSA 

VSPNLSPSASPTSSRSNSLTVPTPPEODEA0VS 

SPHPOEP>m'KGI^RKQNDQRKVSQGW;AP 

RPPPVEKSKEIAIEQKENFDPLQYPETTPKGLA 

pvtnssgkmalna>qpgpveselgkqluctg 
wegsplprsptqdaagvgppasqgrgpag^ 

MGPEAGSKAELPPTVSRPPLLRGLSWDSQPEE 

poprlqkvlaklplaeeekrfaokagokl^ 

APGLKDFQIQVQPVRMQKLTKLREEfflLMRN 

onlvglklpdlseaaeqekolpselspa^ 

ESKSGLDVMPNISDVLLRKLRVHRSLP^P 
LTEKEVENVFVQLSSAFRNDSYTLESWNQ^ 
BERm,raENTEKELENFKASn«SASL1»raKE 
IWKrYOKi.T.EDUVLHRLAARLSSRAEVVQA 
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Amino acid sequence (A-Alanmc c-^y.— , 
D=Aspartic Acid, EXilutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidme, 
I=Isoleucine,K=Lysine,L^Leucine, 
M=Methiomne, N-Asparagine, P^Prolme, 
OOlutaminCj RpArginine, S=Scrinc> 
T^Thrconine, V=Valine. W^^iyptophan. 
Y=Tyrosine. X=Unlmown. *==Stop codon. 
/brpossible nucleotide deletion, \=possiblc 
nucleotide insertion^ 



1802 



453 



1803 



455 



456 



457 



TSoT 



1805 



TSoT 



3623 



3628 



504 




AELMEFKKLANQNSSRSCGP^^^^ 

MSLTLOK>JMPRWlVSVAyWKFN^ 

TPSSSSIPSLPAl^ESPNGKOSLPVTSALPA^^ 

NGKTOGDPDCEASAPALTI^C^^ 

SEEAYSKGFQEGLKKimXJDLKEE^ 

S^PEEVEETCEEETOPRSSKLEELVHFL 

SSS^LGRSTCSAAQro^^ 
WSSGLQHEQFTEQ . 




195 



3637 



364r 



662 



McJuHTGEKPYECKVCGKy^ySS^^^ 
Rffl^GEmQCKELKORGAEMIAVLAVKEQ 

mTPVNYGK — 



VCIAVOCOHLEALNEGTMG 

SoOOWPEASPQQCODFSOroWSOTOI 



[414 



• Vn^/HPAML^LbELUKSGC GHCT^jAU^u^^'^ 

/^Spaossvtfedvavnfsleewsixneaq 

C^r, nimJMSCOVSSOPPRVAGLGLPLKHEPS 



SRVRAPSYDDTT — 

^^^^^^^ 
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seq- 
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SEQ 
ID NO: 
in 

USSN 
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Predicted 
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nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



Amino acid sequence (A-Alanine CK^ysteme. 
D=Aspartic Acid, E«Glutamic Acid. 
F=PhenyIalanine, G=Glycine, R=Histidine, 
I=Isoleucinc K«Lysine, l^Uucine, 
M=Melhionine, N=Asparagine, P==Prolme, 
Q=GlutBminc, R^Arginine, S^Serine, 
T-Thrconine, V=Valine, W»Tiyptophan, 
y=Tyrosine, X=Unknown, ♦=Stop codon, 
/^possible nucleotide deletion. \=possible 

nucleotide insertion 

•pVPDYTSlHlVQSPQGLaNATALPLFUJUiFLS 

scgyvs tdqlnkimp 

Trapasgrsgaglalsanapusgghpgaikg 

PAGSLAHASGSARGTWRVRGRGSHGWERTV 

gaggcanpvpalhscasaprgtgrvsalgpk 
tcss plsspkg 

r OKYNTSMALFDFVLHNS'i GEIRYITEUUVig 

SQNALGKYNTSMALFESNSFEKriLESPYYVD 

lX0TLFVQVSUn-SDPNLVVFLDTC3lASPTSD 

FASnYDLlKSGCSRDETCKWYPLFGHYGRF 

OFNAFKFLRSMSSVYLQCKVLICDSSDHQSRC 

^QGCVSRSKRDISSYKWKTDSUGPIRiiCRDR 

SAO^GNSGFQHETHAEETPNQPFNSVHLFSFM 

NY 



458 



1808 



1809 



460 



1810 



461 



ISU 



3663 



154 



3670 



3671 



850 



557 



2472 



2099 



m 1812 



463 



1813 



464 1814 



3672 



394 



110 



A 3673 348 



3676 



3679 



2253 



803 



LGILMSPQVKAUE l*ALLTPFPOCMQl-SPUlJF 
K*WVSPGLTP/PPPEVPSVFLVEPGLPHAGQA 
GUDLLYrSGDPPASTSQSARTTDVSHRAQPLAI 

.S 



IGVLAFETGSCSVTRLYaGllMPHCSLDLAUSX 
TSAFRIAGTTSVHHHPQLTTFFFWIETGSHCV 
V0TGL»LLALSNPPALASQ1AGISGMSHRAWP 
ni VI v<;i.KFSLLCASO$LlMLFrCYNE 



VKPVNGESKRD*GADTQTCEQEADEgLg;l\N 
CYYD/STKSFFYISCG*K\RKPTWAENRRLNA 
KMFGIPLHSNSDPWGYEEREVIGFHRSRVSRG 

HGS 



QRPmGSQSELLAQPRUU'GRKSSFSmBV 
WPGQKPRPSQQQHQMCASPTLGQRSPFALEP 
VPAYHGGRDPFASARPSPVGIPKPRAAPAGG 

GWRRIRPKSSTK 



PVIQRCSQPYGbSLUSFFLKCVSbi SQgPVSR 
KVFQLLPSFPTLTRSKSHESQLGNRIDDVSSM 
RFDLSHGSPQhmRDIGLSVTHRFSTKSWL^^ 
OVCHVCQKSMIFGVKCKHI^UJKlCHNKCnCE 

Wrisfu>ltrlrrteswsdinnpvdra^ 

PHFGTL?Kj\LTKKEHPPAMNHLDSSSNPSSTT 

FSTPSSPAPFPTSSNPSSATrPPvNPSP\GQR\DSR 

FNFPSC/AYFIHHR\Q\QFIFPDISAFAHAA^ 

AADGTRLDDQPKADVLEAHEAEAEEPEAGK 

SEAEDDEDEVDDLPSSRRPWRGPISRKASQTS 

VYLOEWDIPFEQVELGEPIGQGRWGRVHRGR 

whgevairllemdghnqdhlku^ken^ 

YRQTRHENVVLFMGACMNPPHLAnXSF^G 

rtlhsfvrdpktsldinktoqiaqeik^ 

LHAKGIVHKDLKSIWVFYDNGVKNnaT^ 

xgisgwpvegrrenqlklshdwlcylapeivr 

EMTPGKDEDQU>FSKAADVYAFGTVWmQ 

Xrdwplknqaaeasiwqigsgegj^vi^ 

VSLGKEVSENLSACWAFDLQERPSVFSLLN© 
Na£KLPKLNRRLSHPGHF*KSADINSSKVVPR 

ferfg lgvlessnpkm 

IPSPAWWNSI'WADTFSLLLALAV. 

wacvlqthrafcasntedi^tvvnhi^^ 
oapllavgisfggilvlnhlaqarqaaolva 
altlsacwdsfettrsletplnsllfn^ta 

rw rOLVERLS Y/E*DLQARTIRQFDERYTSVA 
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Amino acid sequence (A«Alanine O^^ 
D=Aspartic Acid, &=Glutaimc AciJ 
F=Phenylalaniae, G^jlycine. HF-Histidme, 
I«Isoleudne,K=Lysine,LH^chic, 
M==Mcthiomne,N=Asparaginc P^Pioimp. 
(ioiutamme. R^Arginine. S=Serine» 
T=Thrcoiiine, V^Valine, W=^ryptophaii. 
Y==Tyrosme, X=Unkiiown, »=Stop codon, 
yt^ossible nucleotide deletion, \=possible 
nucleotide insertion 



DDPFSTVCAU»KQAAQHSPYVAU,rrARGGffl 

OLPDLRALLPSEDRNS 

lQYr/QSKTMhL*AAREKQyRHlCRRbSllUJ> 




KM^YALVSAWHSSOSYUJSEGLIfflQ 
^Cqi , vrHCAAPFEBQOEAERHVLRLQF 



APOSSAGSPGEASOLILAPOPAPUEFPLAAEVO 



HHLESVINTACFaWTRLL''GSGLDH»MSLFL 
TnrrMS»OSAAPAW0QEPPSIWHSH*PIQDSR 



LGPWVSHPLMRWVQKLSHLHSNPGTGFSMG 
GKQQRN 




?SSSvWra^HVG(7rGFELLTSSDLAPLA 
^S^MIVDEISGKIPHYESEIDENTFF 




SvUffGLPSHLSFmAAHNIPE^^ 
SSSNfl.TVKlJGIACATONAM.L 

ScLPLFPKEKSILGSMlJSWrrL^^ 
nrOVWDVSQPSPVTLQIDFPATOWEYMKPD 

l^c-^ rRPiKECIKHIARLSQKQT^ 



EKKRVLWFYRFYC^mENCSLPLVU.^«r«^ 
DOEmKVAVQQLDNLLNDELLEYLPQLVQAV 

L^AaCAmSWQKLLAALQFCAGKALN 
i;™^^^^^i^Kn.GDIGERVRSASDHQRQEVL 
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SEQID b 
KO:of > 
unci- P 
eotide s 
seq- V 
uence 


EQID K 
10: of h 
eptide 
eq- 
lence 


let "S 
od C 
ii 
I 

C 
S 


EQ F 
DNO: b 
1 n 
ISSN 1 
9/496 c 
H4 r 
e 
I 
1 


redicted | P 
eginning n 
ucleotide 1( 
ocation 1 c 
x)rrespondi t 
igto first 1 a 
iminoacid c 
esiducof s 
peptide 1 
sequence 


redicted end 1 / 
ucleotide 1 
jcation 1 ^ 
orresponding 1 
0 last amino i 
cid residue 1 
>f peptide 
equence 1 


Ciiino acid sequence (A^Alaniae C=Cysteme, 
>=Aspartic Acid, EKiliitamic Acid, 
?=Pheaylatanme, G=Gfycine, H=Histidiiie, 
«Isoleu(ane, K=Lysine, l/^Leucine, 
^-Methionine, N-Asparagine, Mroline, 
3=<}lutanune, R^Argfaine, S=6crine, 
r=Threonine, V=Valine. W=^iyptophan. 
ir=Tyiosine, X=Unk3iown, ♦=Stop codon, 
Nyossible nucleotide deletion, V=possible 
aucleotide insertion 














GDDLRQDMLVLQUQVMDNlWLQbULUMQ 

MnYRCLSTGKDQRLVQMVPDAVTLAKU^ 

SGLIGPUCENTIKKWFSQHNHLKj\DYEKALR 

NFFYSCAGWCVVTFILGVCDRHm>nMLT^^ 

GHMFHIDFGmGHAQTFGGIKRDRAPFIFTS 

EMNEYFTTEGGVKNPQHFQDFVMELCCRAYim 

KHSOLLL\NLL\EMMLYAGVLPELSGI\QDLKY 

VYNNIJa^ODTDLEATSHFTK^ 

UQNUHTUVQMSAISPAKSTSQTFPQESCLLST 

reSIERAmGFSKKJSSNLYLIQVTHSNN£SL 

•mKSFEQFSKLHSQLQKQFASLTLPEFPHWW 

HIi»Fmffl)HRRFRDLNHYMEQILNVSHEVTN 

SDCVLSFFLSEAGQQTVEESSPVYLGEKFPDK 

KPKVOLVISYEDVKLTILVKHMKNIHLPDGSA 

reAH^^FYLUWSEVRRRKTKSWKOT 

NEIWYDEVTELQGHVLMLIVKSKTVFVGAI 

MRLCSVPLDKEKWYPLGNSn*PLLLFSSFGM 

Yi^l PKDEFVGGMLLSNPIW 


470 


1820 


A 


3718 


430 


75 


-gHGSISILNLHQGCVPLPSLPAQGLRCYRCLA 
VLEOASCSVVSCPFLDGVCVSQKVSV/CWQ*/ 
CPWGARAEGRLSAWDSQISCCKGDLCNAV 
VT A AfiSPWALCVOLLLSLGSVFLWALL 


■rf 1 


1821 


A 


3723 


891 


494 


lilQSl^SVPQAGVQWRDSSLQAPPFK>^^^^^ 
CLSLPSSWDYRRLPPCLANFLYF* *RRGFrML 
ARMVLIS*PRDPPASASQ\STCrrGGSHRAQffi> 
TDSRDHSERSVKKSHEVISELRMKVIKCKVAF 

hCTlipOTCVSKDTSKiXSyi^ 


472 


1822 


A 


3734 


443 


251 


.|c.pyr,i v«PTrf>F.T.T.RHLDAEOVSSTAGLSL 

HrUsggarsgagwagrovkagieaorogif 


473 


1823 


A 


3746 


3 


500 


LTLSILR1RBLPSGAMSEGVDLIDIYADEEFNQ 

dpefnntdqidlyddvltatsqpsddrssste 
ppppvroepspkpnnktpailytysglrnrra 
avyvgsfswwttoqquqvirsigvydvgev 

l^AP>JRAK 


474 


1824 


A 


3753 


2 


1 5262 


l-^bAREGGlYAVLVCMQEYKTSVXLV^yAU 
UVALKJvlUVVASSSEIPTFVTGRDSIHSUTDAQ 
MTRHFASIDSATRPGSESLLLTVPAAVILMLN 
TCGCSSAARNGLLIX>a.LLX:bmmXGDQirrQ 
ELRDn-FRHSGlAPRTEPMPmTILMMLLNR 
YSEPPGSBERAALETPIIQGQDGSPELLIRSLV 
GGPSAELLLDLERVLCREGSPGGAVRPLLKRL 
OOETOPFLLLLRTLDAPGPNKTLLLSVLRVIT 
WXDFPEAMVLPWHEVLEPCLNCLSGPSSDSE 
IVOELTCFLHRLASMHKDYAVVLCaGAKEI 
LSKVLDKHSAQLLLGCELRDLVTECEKYAQL 
YSNLTSSILAGCIQMVLGQIEDHRRTHQPINIP 
FFDVFLRHLCQGSSVEVKEDKCWEKVEVSSN 
T>m?ACTirT TnHMPJCTYWESNGSTGSHYTTLHM 
HRGVLVRQLTLLVASEDSSYMPARVyVK^ 
DSTSaGTELimWMPSASRVEXENLNRFW 
pnomiKRCQQGGIDTRVROVEVLGPKPTFWP 
LFREQLCRRTaFYTIRAQAWSRDIAEDHRRL 
LQLCPKLmVLRHEQNFADRn.PDDEAAQAL 
GKTCWEALVSPLVQNTTSPDAEGVSALGWLL 
DOYLEQRETSRNPLSRAASFASRVRRLCHLL 
VHVEPPPGPSPEPSTRPFSKNSKGRDRSPAPSP 
VLPSSSLRNTTQCWLSWQEQVSRFLAAAWR 
APDFVPRYCKLYEHLQRAGSELFGPRAAFML 
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SEQID SI 
NO: of N 
nucl- P< 
eotide se 
seq- u 
uence 


IQ \D 1 Met" 
0: of 1 hod 
^tide 

jq- 

snce 


""SEQ r?i 
ID NO: h 
in 

USSN Ic 
09/496 0 
914 n 
a 

r 

F 
s 


ledicted 1 P 
Bgtnning n 
ucleotide h 
ication <^ 
orrespondi it 
gtofirst a 
mino acid c 
csidue of s 
«ptide 
cQuence 


redicted end" A 
ucleotidc E 

orresponding I- 
i last amino > 
old residue C 
fpeptide ' 
equence ^ 
/ 
1 


Sino acid sequence (A=Aianme u=^y5«u.^ 
NAspartic Acid, E=Glutamic Acid, 
-Phenylalanine. G=01ycinei H=Histidmft 
=Isoleucine. K=Lysine, L=Leucine, 
^-Methionine, N=Asparagine, P==Prohne, 
}=Glutamine, R=Arginine, S^Soine, 
r=Threonine, V^Valine, W=Tiyptophan, 
(^=Tyrosine, X«Unknown, *«=Stop codon, 
-possible nucleotide deletion, >r=possible 

l^ir.lftntide insertion rrrrrr, — 

^LRSGFSGALLQQSFLTAAHMSEQ^AKiODQ 












MQGGLIGGAPGVEMLGQLQRHLEPn^SG 
LELATTFEHFYQHYMADiaXSFGSSWLEGAV 

LEQIGLCFPNIUJ>QLMLQSUTSEBL^Q™;F 

QLQRLDKLFLEQEDEEEKRL*EEEEE^^ 

EffiLFIEDPSPAISILVLSPRCWPVSPLCYLYHP 

RKCLPTEFCDALDRFSSFYSQSQNHPVLDMG 

PHmQWTWLGRAELQFGKQILHVSTVQl^ 

lllkfnqteevsvetixkdsdlspelllq^v 

PLTSGNGPLTLHEGQDFPHGOVLRLHEPOPQ 

rsgealwuppqaylnvekdegrtleqk^ 

I^ClXVRlLKAHGEKGLHID^Va^^^ 
KGPNPPGTLGHTVAGGVACTSTDVLSCILffi-L 

MSPQEVEGLMKQTVRQVQm>ltEroVA»I 

GYYDGMSVEAQSKHLAKIiSKRCPSCQi^rE 
roY^NCSAMVSKAARQEmQDTOMH 

cqSarkvlayacvysfysqdaeymdvv 

FRVGLQSPSVEAWEAKGPNMPGSQPQA^ 
i^^EEE^DEDDVPEWQQDEroEELDNPSFS 

YpF?FNi-r>r)15TFFFGDEEEDEDEATO 


475 


1825 A 


3754 


1093 


96 


ATORBBWimQQRTAAIXTDOTrCTG^ 
SRM«lX:WPSRTrGAPGAK*A\a-WSATPTTO 
J^^J^GKLRAPGNRAG/SEPSSQH'PP 
^OTASITGVAQSPA-reATPSLPCLHVPAP 
Kn^TTGRASRLTVDRSRLSWORSA 
SGRWNAPRGRWPRAP'SWGSVm 
S^AAESPPHRCIYCTNHVSPAGPARPS 
kvSTWaSHPLCRAQSSPWEAAGVWRR 
55!^i^vWRKreVKRHDUYQFLGN 

Ti WRRnRORPPETLQPAR . 
- ^®}gVSPCPQAGV*WHUU)SLQNLPPOrk 


476 


1826 J 


V 3758 


901 


521 


^^S1JSSW^YRHVPPRQANFCIF/M»RRG 
SSSKipRDlJ>AlASQSAGITOV^^ 

- -gviSOiCNLRI-/CHLPUSSNSPASASQVA<^^ 


Hi / 


1827 


K 3761 


843 


" 575 


AR^ffVFLVETGFHHVSQDGLDLL/NFVl 
ppppPTVVT.QLOACTRARLPSPLKEL 


478 


1828 


A 3763 


267 


1240 


- HU^FHLWSASLUCLbgLiyliRHVKUMLLGP 
St^CTKPSPSPWKLTPPMCSIPFVFPP^GS 
fSsffSOHSKLEVERAQTGPFaMYCT'P 

Sto]^luwipfprl\sglvcfpah*^s^^ 

S^WsQAWUlQWEVSKHLQCP^L 
SHQPEDPAPOKAGTAEAVlPBmffiVLAG 

Spqdtdardadgeaeerep/rrpsfaa»p 

VWGQpWtPEASSAPPGPTLGTLPEVEl^ 

Sqelp^sprtrqpepdfycvkwwkge 

C^imOSTNGF «!W™l=WPLSSVEGEAPPA 
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SEQID 5 
NO: of 1 
nucl- 1 
eotide ! 
seq- 
uence 

1 


SEQID I 
NlO:of 1 
peptide 
seq- 
uence 


Viet ! 
lod ] 


3EQ 1 
[DNO: 1 
m 

USSN 
09/496 
914 


Predicted 

oeginning 

QUcleolide 

ocation 

corrcspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
Location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanme OCystcine, 
[>=Aspartic Add, EKJlutamic Acid, 
F=Phenylalanine, G=Glycine, H=€fistidine, 
l=Isoleucine, K^Lysine, L=Leucine. 
M^Methionine, N=Asparagine, P^Prolinc, 
Q=GIutaniine, R=AigiDtne, S=Serine, 
T=Thieonine, V=Valine, W=Tiyptophan, 
Y»Tyrosine, X=Unknown, '•^Stop oodon, 
/feasible nucleotide deletion, V=possiblc 
nucleotide insertion 
EGSDfflG 


479 


1829 


A 


3766 


2 


2152 


YSPIRLLEVCVPLPKIFIKRQAPLKVSLLQDLK. 

DFFQKVSQVYVAIDERLASLKTDTFSKTREEK 

MEDIFAQKEMEEGEFKNWIEKMQARLMSSS 

VDTPQQLQSVFESLIAKKQSLCEVLQAWNNR 

LQDLFQQEKGRKRPSVPPSPGRLRQGEESKIS 

AMDASPRNISPGLQNGEKEDRFLTTLSSQSST 

SSTHLQLPTPPEVMSEQSVGGPPELDTASSSE 

DVFDGHLLGSTDSQVKEKSTMKAIFANLLPG 

NSYNPIPFPFDPDKHYLMYEHERVPIAVCEKE 

PSSIIAFALSCKEYRNALEELSKATQWNSAEE 

GLPTNSTSDSRPKSSSPIRLPEMSGGQTNRTTE 

TEPQPTKKASGMLSFFRGTAGKSPDLSSQKRE 

TLRGADSAYYQVGQTGKEGTENQGVEPQDE 

VDGGDTQKKQLINPHVELQFSDANAKFYCRL 

YYAGEFHKMREVILDSSEEDFmSLSHSSPWQ 

ARGGKSGAAFYATEDDRFILKQMPRLEVQSF 

LDFAPHYFNYITNAVQQKRPTALAKILGWRI 

GYKNSQNNTEKKLDLLVMENLFYGRKMAQ 

VFDLKGSLRNRNVKTDTGKESCDWLLDENL 

LKMVRDNPLYIRSHSKAVLRTSIHSDSHFLSS 

HLnDYSLLVGRDDTSNELWGITOYIRTFTWD 

KKLEMVVKSTGILGGQG^MPTVVSPELYRTR 

FCEAMDNYFLMVPDHCTQLGLNC 


480 


1830 


A 


3777 


251 


3 


" QGCGSAGTLlHY**ECKMVQLLWKTVQl'l-l 
KLNI\KDPAITLDVYPNEVKNYVRTECTYTQNIF 
1/ANFIMAKSWKQPTHPSVRT 


481 


1831 


A 


3779 


333 


3 


EAAmQPEPNILDVNQIFKDLAMnHDQGDLlu 
SIEANAESSEVLVERAPOQLQRPAVYYQKKSR 
KKMCLWLVQTAIDLICERIM+WYTTKWSPPI 
VLPVSCFOGOKFN 


482 


1832 


A 


3780 


2 


371 


TGORQGKNDHTSrrEKPSRDFNRHUTQNl'M 
PNQDMKSSSNSLmVQimTLYHHIFTRKA 
KMKTTDKTKYR*GFK^UTrLmCSQIX»XJ*S 
/L*ENHFM1FPKABQHITYDTTIPFLR 


483 




A 


3787 


43 


448 


Li^LSPYVMETEIYILNRU^ERARSMWKttuu 

KLPNTKDQEKILKAIRGRREVIQGS/RQQYRR 

PAAFSAAEKARRLWCSA^FNIERRNL/CEYPTK 

LSFNIKGE^^^FSDKTEFTTNRPSLKMUJa)RI 

OEEGKMF*KEKCFKRKE 


484 


1834 


A 


3798 


1 


727 


" FFFFETESRSVAQAGVQWCmGSLQALPnjJ'X 
SHSPASASRVAGTTGTRH*ARLIFYIFSRDGVS 
PC* PGWS* SPDLVIRPrVRLrJvO WJJ i tsjsjarrssr 
A*FFWLVE\QGFrMLARMVSIS*PQ/CDLPAS 
VSQNAGITGVSHCAWPCLHFCFFGFFFEMESC 
SVAQAEVQWHDLRSLQAPPPGFIPFSCLSLPG 
SWDYRRPPPRPANF\CIFSRDGVSPC*PGWSRS 
PDLVIRPPRPPKVLOLQA 


483 


1833 


A 


3S02 


1 


239 


" TFFFEMECLTVSQAGVQWYNLHSLQPU'1'W 
KQFSaLSLPSSWD*RVPTSRPAKF/CVIF*DGV 

SHCOPGWSAWQPPLH 


486 

» 


1836 


A 


3811 


378 


98 


' "RYD*ssQSE^m^Qla^^^LKYP*c^A^x-GMK^l 

MSIMKKKSIFSAEFYKVSLPSLLLVHLLAIEWQ 
FHIEIOLTIHQHFLNYELESDFVHIVEYM 


487 


1837 


A 


3814 


771 


320 


'FDPDWTRAAGIRHEKJa>KAIj\YRREN 
PPPPLPPPEEEASWAL/GAEGSRQHVLPGAGA 
QWGEESGPGRAPGSPAGAPPR*RGLAPVNSRP 
SFLSRGQGTSTCSTAGSNSSROSSSSROSRGPG 
RSRSRSQSRSQSORPGQKRREEPR 
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I seqid" 

I NO: of 
I nucl- 
I eotlde 
i seq- 
uence 



SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



1838 



[489 



1839 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



Predicted 



3818 



nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 



1840 



3822 



3825 



Predicted easi 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



934 



781 



669 



79 



97?r 



Amino acia sequence (A-Maame C=Cysielne, 
I>AsparticAcid,&01utamicAci4, 
F^Phenylalanine, G=Glycine, H^Histidme, 
I=Isoleudne,K=Lysine,LFLeucm^ 
M=Methionine, N=Asparagine, P=Prolmc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=^eonine, V=Valinc, W=Tiyptophan, 
Y«=Tyrosine, X=Uniaiown, *^Stop codon, 
/possible nucleotide deletion, V=possiblc 
nucleotide insertion 



iaiL\FCLAGFSmGQVU=KGCDVKTTFVT 




S^M^GTIGACHHAQUFVFLVETOFHHVG 
^^^^^^ 




SSaprsli^wrfaei^vrpqk 

v^SoVTOKEMEVSPSAEQLVQVTOL 




Xgggsscspvlsrkqkgkvllgee^ds^ 

ESRSDVSSSALTASVKBEISGH^SSGVSWJ 
iAGHDnraQPRSQHTLQADSVDLASroi^S 

TftA^SPlSDSSOTTTEQPDSAVTPSDSSEIVLD 




m>«ICVW.LSASaLTGOKNVLVI^VKV 
^KSvGAAVALHPESFFSKLYKVPLD 

^T^^JSoWSDIUmDHGDPQ^^^ 
^K^DWMGmUTGNTFSL 



^WxQPPSHFSVSTITWYMTa^TO 

IJSENNLSRVIAAVSHELrrSTTRALWC^^^ 

ALCLLSTAFPVCIWSLG\VHCGVPPLSASDESIl 

STagnli^sapkslrsswas^anp^ 

?™n?;nCAPr,PAIKAALPSLTNPPSLSPIRBK_ 
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I NO: of 
nucl- 
eotide 
scq. 



SEQED 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



beginning 
nucleotide 
location 
conespondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 



rpFSIHted end I Ambo acid sequence (A-Alumne C^K^steine, 
nucleotide I>Aspaitic Acid, EpClutamic Acid 

F^Phenylalanine, (HJlycine, H=Histidme. 
corresponding I^Isoleudne, K=Lysine, L-Leucin^ 
toSino M=Methionine,N=Aspaxagine^P=Proline, 

acid residue <H>l«*^%^=?^rt 
of peptide T^Thiwnine, V=Valme, W=Tryptoptoi, 
seq^Tce Y-Tyiosine, X-Unknown, ♦^Stop codon. 
' /=possiblenuclcotidedelction,\=possibic 
nucleotide insertion 



491 



492 



1841 



1842 



A 



3836 



46r 
39r 



nucleotide i nsertion — 

rmTTCVQLQKQVLDLLAQLVQLRVNYULL 
DSDOVFIGFVLKQFEYIEVGQFRESEAIIPNIFF 
FLVlisYERYHSKQnGIPKIIQLCTCIMASGR 
KAVTHAIPALQPIVHDLFVLRGTNKADAGKE 
LErQKEWVSMIXRLIQYHQVLEMFIL\a.QQ 
CHm^DKWKJU.SRQlADIILPMLAKQQMHl 
DSHEALGVLNTLFEILAPSSLRPVDMLLRSMF 
VTPNTMASVSTVQLWISGILAILRVUSQSTED 
TVT cRTnFT.SFSPYLISCTVINRLRDGDSTSTLE 



MNSEFNLSLLAPCLSLGMSEISGGQKSALFEA 
^^VSGTVQQLPAYffilVFQPELPy^P 
AAWSKLNDUfGDAALYQSLPTLARALAQY 
LVWSKLPSHLHl^PEKEKDIVKFWATIXAL 
SWHLIHEQIPLSU)IX3AGLD0CX:LALQLPGL 

EQLLSPERRTrm>KAISEEEEEVDPNTQNPKYI 

taacemvaemveslqsvlalghkknsgvpa 

WSPKPGGDFGTAFPEXPVEFLQEKEVFKEHYR 
TNTtGWreRTQFEETWATLLGVLVTQPLVME 
OEESPPEEIKERTQINVLAVQAIX^LVLSM^ 
WVAQNPAVSCXEQQPW«3>LKALDTRFGRK 
I^mGIVEQEIQAMVSKRENIATHHLYQAWD 

YKLGQVSfflSVWLGNSITPLREEEWpEEEEEE 

ADAP^SSPPTSPVNSRKHRAGVDIHSCSQFL 

LELYSRWILPSSSARRTPAILISEWRSLLVVS 

DUHERNQFELMYVTLTEUIRYHPSEDEIL^^ 

YLWATC^AAAVLGMDKAVAEPVSRLLESTL 

RSSHLPSRVGALHGVLYVLECDLLDDTMQL 

IPVISDYLLSNLKGIAHCVNIHSQQHVLVMCA 

TAFYLIENYPLDVGPEFSASnQMCGVMLSGS 

EESTPSnYHCALRGLERLLLSEQLSRLDAm. 

VKLSVDRVNVHSPHRAMAALGLMLTCMYT 

GKEKVSPGRTSDPNPAAPDSESVIVAMERVS 

VLFDRIRKGFPCEARWAWLPQRDD^Q 

DIMNKVIGEFLSNQQPYPQFMATWYKVFCTr 

LHSTGQSSMVIU)WVMLSLSNFrQRAPVAm 

TWSLSCFFVSASTSPWVAAILPHVISRMGKLE 

OVDVNLFCLVATDFYRHQIEEELDRRAFQSV 

hl^mSRVAGITGVHQHAWLll' VFtVEMEF 
imvr^AVT.KLLTSGDLPVSASOSA 



rSNPPASASRVAUlluvnvn/^wLor i 
mwnO AVLKLLISGDLPVSASQ SA 
rVAPSPMIMPl)LYFYRDPEEilim;*AAA^^E 
' FOSEWTAW/P/EFTATQSEVADWFICDMQW 
SVPIQQFPTEDWST»PTMNDWSATSTAQXTE 

WVRITTBWP . 
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1 SEQID S 
NO: of 1^ 
nucl- P 
eotide a 
seq- u 
uence 


EQID IV 
0:of h 
eptide 
Bq- 
ence 


let S 
od H 
it 
I 
0 
9 


EQ P 
DNO: b 
1 n 
[SSN I 
9/496 c 

14 E 

a 
I 

1 

J 


redicted P 
eginning n 
ucleotide l< 
jcadon c 
orrespondi t 
ig to first a 
mino acid c 
■esidue of s 
)eptide 
sequence 


redicted end / 
ucleotide I 
acation ^ 
orrcsponding 1 
0 last amino 1 
Lcid residue < 
il pepuuc 
equence 


SSSTacid sequence (A=Alanine C=cysifime, 
)=Aspartic Acid, EpGlutamic Acid, 
^Phenylalanine, G*Glycine, H=Histidine, 
=lsoleucane, K=Lysine, LpLeucine, 
si=MclhiQninc, N«Asparagine, P"Prolinc» 
3=Giutamine, R'^Arginine, *=Serine> 
r=Threonine, V=VaIine, W'O'typtophan, 
y=n'yrosine, X»Unknown, ♦«Stop codon, 
f==possible nucleotide deletion, V^ssible 
nucleotide insertion — 1 


493 


1843 i 


\ : 


)838 


19 


m 


rPSDMNRAFETDTQSlGEKNRSPSHPmj'HH^ 
KFKRS*EKAHIRYKIDQPEDIPLK\EFLCKHSK 
CTATLSMRNMSimKKCSFSEEFUJVFF^ 
vrm.LAIKLGFYIEIHLTTFNNTF J 


494 


1844 




3845 


2 


352 


ITFLRRSL/DSVAQAEAQWL^UiaQAP^^^^^^ 
KPISUAOLPSSWDYGRPPPCPANFCIF/M*RRO 
FrVLARMVLIS*PCDPPTT-ASQGTAITGMSYH 

ARPQDTDFLYAHOGRCWFRLL 

DIpfeREGMGQDEAQFSVlM'Ll^^ 


495 


1845 


A 


3847 


1774 

./ 


40 


WADKYia»RKPRFFmVHTGFEWNKYNQTHY 

DFDNPPPKIVQGYKR^DUDJ^TP^ 

EACADNKDFAILRFHAGPFmJIAFKIVNREW 

EYSHRHGFRCQFANGIFQLWFHFKR^^ 

RPWGTAGRCPRGHSKOASVKLVVTPCTLSG^ 

0GRGFTSHLRPHLSFARPQFPP1*KGGHH*AC 

rrocT DUTTOmRT A*GPnATBOALGASFEHEG 
HGELRRHWUKIA 

CWQPPADLTVQADTLHBPSARLGOAHRACPK 

MtraRVLWRWARGAWAWRCQAREKQCTQQ 

QPCHrroHPLQREAEPAAAGAAPAIAHRPP^ 

ARTGSTE\POPCWRPIRHCRKDPLWTPTLC«D 

WPTHPVLAGGVHFPAAGflGGCVEVPVSVN 

VMGTKSH»AVLPPPPSTGPGGQGLPEGWGL£ 

KOEOLPPGffPPGIXTOPV^MRPVTPSFAHR 

TVAPSHSPFSQQEGRGPHGCHSPGR\SGPVAGR 

LVLOHPTGTSPTEAKKKVHPGPPEGHPTSPVT 

SPRFPtAITOHPASSGNSSVCFSKKTCRWEKK 

SPVIAIELAYWODRMFF 




1846 


A 


3849 


830 


442 


AkSPLPU3»iy VVK/NtObLKLRLPOl'K.-r icw 
LLSSWDYRSUPPRPVNFCILVELGFHHVDQAG 
UOLTSSALPALASQSAEriGMSHRIWPLPLLR 
pppvmiPAPPO'' 'TSLKALSPNMATF 


497 


1847 


A 


3859 


2 


393 


AUtKTR&iXjiAfeTGAQPAASWKGTONYl'WK 
LEMAGRPGSQEQSKDRGTGSLPPPSQRPLCTS 
PEOAGPSPPPPOIPRGGGSSSSEOP/PQU™ 
RRPAPKKQLPSDTPHSKAPPTPHLILGGEDSQ 

■ KNASmSSQGDPKSFFaURWSlALVAyAun 


49S 


1848 


A 


3860 


253 


634 


^lSa,QPPPPGFK*FSa^LPSSWD\mC^ 

LPCLANF\*FLVErGFHHVGQADLKLLTSODP 

PT^ASFSAnrrovSHRAWPRIHFLYWKTFFL 


[499 


1849 


A 


3863 


423 


263 


- APSQISVAFLYAA/DKlJEKlil^WUPHl^^^^ 
KlGWLTKEVKYLYTENYrrLMKEIKyDTDKW 

KDILY*mGKIW*mSTPPKAmro^™ 
MrFFrEIEKSIIKFIWNHmPKrQSNIEQKE'S 
FCSE.LWVFOGFLWFHMNFMIDFSISVKNVIGI 


500 


1850 


A 


3865 


2 


15246 


- iJRGCLWCLQRSFTPARPQPSKl'ARSPLrUfJ' 
DUffWASDLDMGDAEGEDEVQFUOTDEV 
\n nncATVX KPni KLCLAAEGFGNRLCFLEP 
TSKAQNVPPDLAICCFVLEQSLSVRALQEML 
ANTVEAQVESSQGGGHRTLLYGHAIIXRHAH 
SRMYLSCLTTSRSMTOKUFDVGLQEDATGE 

AWWTNflffASKQRSEG^ 

SERYLHLSTASGELQVDASFMQTLWNMNPIC 

SRCEEGFVTGGHVLRLFHGHMDECLTISPADS 

DDQRRLVYYEGGAVCTHARSLWRLEPLRIS 

WSGSHLRWGQPLRVRHVTTGQYLALTEDQG 

LimDASKJSmCATSFCFRISKEKL^ 
DVEOMGPPEBCYGESLCFVQHVASGLWLTYA 
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SEQID S 
NO: of > 
nucl- p 
eotide s 
scq- y 
uence 


EQK) ^ 
10: of h 
eptide 
eq- 
cncc 


let S 
od H 
ii 
I 
0 
9 


EQ IP 
3 NO: b 
1 n 
JSSN 1 
m96 c 
14 n 
e 
I 
I 


redicted i 
eginning n 
ucleotide 1 
[)cation € 
orrespondi t 
igtofiTSt £ 
iminoacid ( 
■esidue of i 
peptide 
sequence 


redicted end / 
ucleotide I 
ocation ^ 
.orrcsponding ] 
0 last amino 1 
icid residue j 
)f peptide 
sequence 


onino add sequence (A^Alanine OCystcme. 
>=Aspartic Acid, EFGlutamic Acid, 
^Phenylalanine, G=<jlycine, H=*Histidine, 
-Isoleucine, K=Lysine, LHLeucine> 
vl-Mcthionine, N=Asparaginc, P«Prolinc, 
5=Glutamine, R=Arglnine, S«Serine» 
r=Threonine. V^Valine, W-Tiyptophan, 
y=Tyrosinc, X=Unknown, ♦-Stop codon, 
/=possible nucleotide deletion, \*=possiblc 

nucleotide Insertion - 

APDPKALia.GVLKKKAMLHQEGHMDDALSL 

TRCOQEESQAARMIHSTNGLYNQFIKSLDSFS 

GKPRGSGPPAGTALPIEGVILSLQDLIIYFEPPS 

EDLQHEEKQSKLRSLRNRQSLFQBEGMLSMV 

LNCIDRLNVYTrAAHFAEFAGEEAAESWKEI 

VNLLYELLASLIRONRSNCALFSTNLDWLVS 

KLDRLEASSGlLEVLYCVLIESPEVLNnQENHI 

KSIISLLDKHGRNHKVLDVLCSLCVCNGVAV 

RSNQDLira^LPGRELlXQTmiN™!]^ 

IFVGRAEGTTQYSKWYFEVMVDEVTPFLTAQ 

ATHLRVGWALTEGYTPYPGAGEGWGGNGV 

GDDLYSYGFDGLHLWTCar/ARPVTSPGQHL 

lAPEDVISCCLDLSVPSlSFRINGCyVQGVFESF 

NLDGLFFPWSFSAGVKVRFLLGGBHGEFKF 

LPPPGYAPCHEAVLPRERLHLEPIKEYRREGP 

RGPHLVGPSRCLSHTDFVPCPVDTVQIVLPPH 

LERIREKLAENIHELWALTRIEQGWTYGPVRD 

DNKRLHPCLVDFHSLPEPERNYNLQMSGETL 

mUU^HVGMADEKAEDNLKKTKLPKTY 

MMSNGYKPAPLDLSHVRLTPAQTTLVDRL^ 

NGHNVWARDRVGQGWSYSAVQDIPARRNPR 

LVPYRLU^EATKRSNRDSLCQAVRTLLGYGY 

NIEPPDQEPSQVENQSRCDRVRIFRAEKSYTV 

OSGRWYFEFEAVTTGEMRVOWARPELRPDV 

ELGADELAYVFNGHRGQRWHLGSEPFGRPW 

OPGDVVGCMIDLTENTIIFTLNGEVLMSDSGS 

ETAFREIEIGDGFLPVCSLGPGQVGHLNLGQD 

VSSLRFFAICGLQEGFEPFAINMQRPVTTW 

KGLPOFEPVPLEHPHYEVSRVDGTVDTPPCLR 

LTHRTWGSQNSLVEMLFLRLSLPVQFHQHFR 

CTAGATPLAPPOLQPPAEDEARAAEPDPDYE 

NLRRSAGGWSEAENGKEGTAKEGAPGGTTQ 

AGGEAQPARAENEKDATTEKNKKRGFLFKA 

KKVANmTQPPATFnJiaPHDVW^>^^ 

PEin.NTrTYYYSVRVFAGQEPSCVWAGWVT 

PDYHOHDMSFDLSKVRWTVTMGDEQGNV 

HSSLKCSNCYMVWGGDFVSPGQQGRISHTDL 

VlGCLVDlATGLNmnrANGKESNTFFQVEPN 

TKLFPAVFVLPTHQNVIQFELGKQKNIMPLSA 

AMFQSERKNPAPQCPPRLEMQl^NffVSWSR 

MPNHFLQVETRRAGERLGWAVQCQEPLTMM 

ALHIPEEmCMDILELSERIJJLQRFHSHa^^ 

YRAVCALGNNRVAHALCSHVIKIAQLUIALE 

DAHLPGPLRAGYYDLLISHtESACRSRRSNO. 

SEYIVPLTPETRAriLFPPGRSTENGHPRHGLP 

GVGVTTSLRPPHHFSPPCFVAALPAAGAAEAP 

ARLSPAIPLEALRDKALRMLGEAVRDGGQI^ 

RDPVGASVEFQFVPVLKLVSTLLVMGIFGDE 

DVKOILKMIEPEVFTEEEEEEDEEEEGEEEDEE 

Z,,,^^r^r^T A r>T5iri7nT?PlfFFPRAAEGEKEEG 
EKEEDEEETAQEKJbUjiErKrJ^Js^rt'*^ 

LEEGLLQMKU'ESVKLQMCHLLEYFaJQaQ 

WveslIWaeryvdklqanqrsryoujm. 

FSMTAAETARRTREPRSPPQEQlNMtl^EKDO 
TOEEDCPLPEEIRQDLLDFHQDLLAHCGIQLp 

PEEERSAEESKPRSLQELVSHMWRWAQroF 

vospeCvramfsllhrqvdglgellralpra 

YTISPSSVEDTMSLLECLGQIRSLLIVQMGPQE 

. ENL^DQSIGNIMNNKVFyQHP^a>lRJ^®;4ffi 
TVMEWlVNVLGGtffiSKEIRFPKMVTSCCRa 
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■ " 1 ^ 4< . J — J 1 'i 


toino acid sequence (A=Aianine C=Cysteine, 
)=Aspartic Acid, &=Glutamic Acid, 
'=Phewlalanine. 0=Olycine, H=Histidliie. 
=Isoleucine. K=Lysine. L=Leucine, 
^t=Mettiionine,N=Asparagine, P=Proline, 
3=01utamine, R=Arginine, S=Serine, 
Mlreonine, V=Valine, W=Tryptophan, 
^-Tyrosine, X^Unlmown, •-Stop codon, 
fc^jossible nucleotide deletioiu \"T»ssible 
BBcleotide insertion ■ „ , ■ 


SEQBD S 
NO: of ^ 
nucl- P 
eotide s 
seq- u 
uence 


EQID N 
10: of h 
cptide 
eq- 
ence 


let S 
od B 
ii 
I 

C 
S 


EQ P 
DNO: b 
1 n 
JSSN U 
9/496 c 
H4 B 

8 

r 

{ 

s 


rcdiciea r 
eguming 1 n 
ucleotide 1 i 
}cation c 
orrespondi t 
gto first 1 & 
iminoacid < 
csidueof 1 2 
jcptide 
>equence | 


ucleotide 1 
^cation ^ 
orresponding 1 
0 last amino 1 
Lcid reddue < 
>f peptide ' 
icquence 

i 














CYFCRlSRQNQRSMFDHLSVLLENSbltjLOM 

OGSTPLDVAAASVTONNELALALQEQDLEKV 

VSYLAGCGLQSCPMLVAKGYPDIGWKPCGG 

ERYLDFLRFAVFVNGESVEENANWVRLUR 

KPECFGPALRGEGGSGLLAAIEEAIWSEDPAR 

D0PGIRRDIUWEHFGEEPPEENR^aaGHM^4S 

FYAALIDLLGRCAPEMHUQAGKGEALRreiU 

LRSLVPLEDLVOnSLPLQIFrLGKPGALVQPK 

MSASFVPDHKASMVLR0RVYGIENQDFLLH 

V11)VGFLPDMRAAASLDTATFSTTEMALAV 

NRYLCXAVLPLmCCAPLFAGTEHRAIMVDS 

MUnVYRI^RGRSLTKAQRDVIEDCLMSLCR 

YIRPSMLQHLUUU-VFDWILNEFAKMPUaL 

TNHYERCWKYYCLFTGWANFGVTSEEELHL 

TOKLFWGIFDSLAHKKYDPELYRMAMPCLC 

AIAGALPPDYVDASYSSKAEKKATVDAEQNF 

DPRPVEILNVIlPEKLDSFINKFAEYrHEKWAF 

DKIOI^mrSYGENIDEELKTHHMLRPYKTFSE 

KDKEIYRWPKESLKAMIAWEWTIEKAREGE 

EEKTEKKKTAKISQSAQTYDPREG\W 

SAVTLSRELQAMAEQLAENYHKrWGRKKKQ 

ELEAKGGGTHPLLVPYDTLTAKEKAKpmA 

0ELLKFLQMNGYAVTRGLKDMEU5SSSIEKR 

FAFGFLQQLLRWMDISQEFIAHLEAWSSGRV 

EKSPHEQEKFFAKILLPLINQYFTOHaWW 

TPAKVLGSGGHASNKEKEMimFCKLAALV 

RHRVSLFGTDAPAWNCLHILARSLDARTVM 

KSOPEIVKAGLRSFFESASEDIEKMVENLW-a 

KVSQARTQVKGVGQNLTYTTVALLmmF 

QHIAQHQFGDDVILDDVQVSCYRTLramLG 

TOCOTYVEKLRPALGEC3LARLAAAMPVAFLE 

PQLNEYNACSVYTTKSPRERAILGLR;ISV^ 

CTDIPVLERLMADIGGLAESGARYTEMPHVB 

rrLPMLCSYLPRWWERGPEAPPSALPAG/J'PP 

CTAVTSDHU^SLIXJNILRnVNNLGlDEASWM 

KRLAVFAQPIVSRARPELLQSHFIPTIGRLRKR 

AGKWSEEEQLALEAKAEAQEGELLVRDEFS 

vCamLYALYPLURYVDNNRAQWLTEPOTS 

AEELFRMVGBIFiyWSKSHNFKREEQNFVVQ 

NEINNMSFLTADNKSKMAKAGDIQSGGSDQE 

RTKKKRRGDRYSVQTSUVATLKKiyiLPIGLN 

MCWTOQDLm^mYALKDTDEEVREFLH 

NNLHLQGKVEGSPSLRWQMALYRGVPGRffi 

DADDPEKIVIWVQEVSAVLYYLDQTCHPY^ 

KKA\niraKli^QRRRAWACFRl^LYNLP 

^SACNMFLESYKAAWILTCDHSFEDI^roD 

iS^GEOEBEEEEVEEKKPDPLHQLVLHFSRT 

ALTEI^^DHiYLYMAyADIMAKSCHLEEG 

GENGEAEEEVEVSFEEKQMEKQRLLYQQARL 

GIsSatMEV(K)KMLDYLKDKKEVGFFQS 

lOALMOrcSVLDLNAFERQNKAEOLOMVNB 

DGTVINRQNQEKV>MDDEFTQDUTIFLQLLC 

EGHNNDFQhm-RTQTOTmriNUICIVDYLL 

RLQESISDFYWYYSGKDVIEEQGKRNF^ 

SVAKQVFNSLTEYIQGPCTGNQQSLAHSW.W 

DAWCTLHVFAHMMMKLAQDSSQIELLKEL 

LDLQKDMWMLLSLLEGNWNGl^QMV 

DMJ^SSNN^NnLKFFDMFLKLKDIVGSEi^J' 

ODYVTDPRG'ISKKDFOKAMDSQKQFSOPEI 
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SEQID S 
NO: of > 
nucl- p 
eotide s 
seq- 
uence 


EQID N 
10: of h 
eptide 
eq- 
ence 


let S 
od I] 
ii 
I 

C 
S 


eQ t 

DNO: b 
1 n 
JSSN 1 
9/496 c 
114 r 
e 
I 

1 

< 


tedicteci v 
cginning n 
ucleotide 1 
Dcation c 
mespondi t 
ig to first 2 
imino acid < 
-esidue of i 
3eptide 
sequence 


redicted end / 
ucleotide I 
Dcation 1 
orresponding 1 
0 last amino 1 
icid residue i 
)f peptide ' 
jcquence 

i 


Siimo acid sequence (A^Alanlne OCystelne, 
>Aspartic Acid, E=Glutaniic Acid, 
'=Pheaylalauine. G=Glycine, H=Histidine, 
=Isoleucine, K=Lysine, L=Leucin^ 
^=Metiiiomne,N=Asparagine, P=Proline, 
JMjIutainine, R=Arginine, S=Serine, 
r«Threonine, V=Valine, W=Tryptophan, 
y«Tyrosinc, X«Unknown, *=Stop codon, 
fc=possible nucleotide deletion, V=possible 
nucleotide insertion J . 














OFLLSCSEADENEMINCEHFANRFQBl'AKUio 

R^A\a.LTha.SEHWHDPRLHNFLELAESILE 

YFW^YLGmiMGASRRIERIYFEISETNRAQW 

EMPQVKESKRQFIFDVVNEGOEAEKMBLFVS 

FCEDTIFEMQiAAQISEPEGEPErDEDEGAGA 

AEAGAEGAEEGAAGLEGTAATAAAGAT/AV 

VAAAGRALRGLSYRSLRWIVIUU.RBLT^ 

ATAVAALLWAAVTRAGAAGAGAAAGALa 

LWGSIJGGGLVEGAKKVTVTELLAGMPDPT 

SDEVHGEQPAGPGGDADGEGASEGAGDAAE 

GAGDEEEAVHEAGPGGADGAVAVTDGGPFR 

PEGAGGLGDMGDTIPAEPPTT^SPILKRKW 

VDGVEEELPPEPEPEPEPELEPEKADAENGEK 

EEVPEPTPEPPKKQAPPSPPPKKEEAGGEFW^ 

ELEVORVKFLNYLSRNFYTLRFLALn^AIN 

FILLFYKVSDSPPGEDDMEGSAAGDVSGAGS 

GOSSGWGLGAGEEAEGDEDENMVYYFLEES 

SpALRCLSUJnLV^ 

PQLLTWLMSIDVKYQIWKFOVIFTDNSFLYLO 
SvMVMSLLGHYNNFFFAAHIXDIAMGVKTL 
KsVTHNOKQLVKTTVGU^VVVYLYTVV 

FHMYVGVRAGOOIGDEIEDPAODEYH.YKVV 
ronWFFVIVILLAnQGLnDAFGELRDQQEQV 
KEDMmCHCGIOSDYlTyrTPHGFEraTLEE 

^SWmfflmylinkdetehtgqesyvwk 

^ilYnFPrWPi^AfinCFRKOYEDOLS 


501 


1851 


A 


3869 


467 


665 


iJicFnCGKINSBWRELSVR 


502 


1832 


A 


3888 


1042 


724 


yRHAP«^LTNF\»FLVEMGFCyVGQAGRm- 
ASSlXJSALASQSAGrrOIolArOrrfrri-i'iriin 

OOrSVAOAGVO 


503 


1853 


A 


' 3891 


1773 


1193 


■ EVDSQSGVQ*QAPGSLQLyiPULKA'SCLL^ 

VOGGLKLUPSSNPFASAP'TAOrrGMSHCACT 
imff»MFRKISCIRE*F*HTRIYDIPFL]LFFKCT 
WVLLCYPGWPQIPGLKPSSCIRLLSSWDHRC 
APPCPASFFIFHVDRVSPPCPGLVSrrFKMLLL 


504 


1854 


B 


3896 


279 


70 


TQKHTIYHPYQVIFWSTOKDAMRSFMMPFY 


505 


1855 


A 




2 


1396 


NENTKLELRKVPPELhn^SKLNEH*'J)W'JJi^^ 

SS^TTOnmKDEVKAASPGRCLP^^^ 
^reTOMQKELLOTELDLYKKMQAGEEVTEL 
SSmOLEAAKRGILSSGRGRGIHSRGRGA 
VHGRGRGRGB"»»VP""AVVDHRPRALE1S | 
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SEQ ID 1 St 
NO: of N< 
nud- pe 
cotide se 
seq- u( 
uence 


3: of hod 
iptide 

Pl- 
aice 


1 SEQ TP^ 
ID NO: b€ 
in m 
USSN lo 
09/496 a 
914 • n 
aj 
n 

P 

s 


edicted TR 
^ginning w 
icleotide lo 
cation a 
>nespondi tc 
gto first a 
inino acid o 
:siduc of s 
eptide 
equence 


^ctedcnd 1 A 
icleotide ^ 
cation F 
sncsponding 1= 
) last amino N 
cid residue C 
f peptide 1 
equcnce ^ 

f 


inino acid sequence ( A==Aianme C^'ystem^ 

* _i' _ At^A T^sTrliittnTliR Acid. 

p^Aspartic Acid, li=u luiamiu /vwm, ^ 
=Phcnylalanine. G=Olycine, H=Histodine, 
=Isoleucine,K=Lysine.L=Leucine. 
4=Methionine, N^Asparagine, P^Proline, 
Hjlutamine, R=Arginine, S=Serine, 
'^Threonine, V-Valine, W=Tiyptophan, 
^-Tyrosine, X=Unknown. *«Stop codon. 
=possible nudeotide deletion. V=90ssiblc 
mcleotide insertion ^ 










i 

] 


/ITFKTRAEAEAAAVHGARFKGQDUOAWN 

-.M 4 X Ti »»i^T?\rPPr\'CT?T?OPP.Tll A 

KPVTNnSAViniEEVEPUbtAVKciii'^ 


506 1 


856 A 


3911 3 


1952 i 


>19 ^ 


OAEi^QUSLVLTQCCKRIKlJIVyhJ^^^^ 
DSQRLLNEVMVEHFFRQGMLDVAKIXQES 

GS?^QGffiNSPYVHlXD^QWADOT 

TrnMn-inRKl vrPvrPMEOSPGPAKQIFF 
'SHPFSPAPGHJDAFPPU'KPSKOUjHfO 1 AOA 


507 


1857 A 


3936 


439 


18 


-^SbFPGPQEMVaEmVlvWQLl^ 


508 


1858 A 


3944 


120 


412 


IQLYAAIETPHEm,FME\YECPK*W*GLG^T 
TRHGASRGGVCAHSIEGGELFERIVDEDYHLT 

FV 

LTKTPSPRliKliRGVLSVlXMMl*KUlVU' VKIP 


509 


1859 A 


3949 
3954 


31 
1013 


392 
885 


rrFSDSKS*YKATKIKTMWYCHKNRYID/ERN 
TsIfRAOCPRLEHSGRUiAHCSLNU'Cji.SOPPT 


510 
511 


1860 A 

1861 A 


3956 


■ 


1054 


SASSVAATTG f il/Vtev" 

■ PPAWAPRSPU\VAPlTJ01uua.KnAU;WSn^^ 
RWQPSEKQPPPPAHRGPADSLSTMGAAELS 
AEGAOKSRGSGEQDWVNIO'KTVEDTLL^ 
OHGHSGPFESKFKKEPALTAVARTAR^Kre 
PEP^EVGPPK\TreRPSRGCPHPQRGSRS^L 

L^ffli^CLRHHPLPHLIPTGPHRlJa^^ 
S^ILVADNAGGSHASKDANQVIKTT^ 

SNSPPSPSSMNQRRUJPI^OGQQA®"^ 
EPVHPASLPDSSIATSAPLCCTLCHERljmra 
FV^^SHKFCFPCSRQSKQQOASCaEVTC 
PSGEKCPLVGSNVPWAFMQGEIATILAQDVK 

- ^TT^r^/^eATUAU/^ PI P(a*DSPASAttK 


512 


1862 i 


K 3957 


1086 


3 


QDRARLDUSSA 1 t>AHCHUUJiuJJJ^i 

VAGTTOTHHHimiLGSSVQTGFDHVGQAG 

LELLTSGIOTBASESAG1MGMSHCVWP»SWG 

tISSwDGGRARGWSPEQSlW: 

H'KvPS'lMTQI^OIfflQYmMKRQt 

iRSca,PLPOBGEPGLQPSS\*SQNPCSSPL 

K^AmiSELLLQGQARRH*RSP^ 

CPAT15LTAWSQTKRl.RaQrW;ijr^^ 
PPVCHWSRRSLGDFIiPRSQG'FDGTASm: 

sSraSHLVAQAGVQWWJLOa^PPOT^ 
lOTFSRLSPPSSYTOKYVPSHLAESdSSRDRIP 


513 


1863 


A 3961 


3038 


476 


VGDmLSIDGTSMEYOI^TQ^ANTTOQ 
VKLElPIfflQTRI^CSPDHVraQR^^J 
WSWAShMSSLHirafflYKTYHPDHCRWM- 

MOTLPRSLYSTSPROTT^tt^'^'tMjg^^^ 
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SEQID 2 
NO: of I 
nucl- T 
eotide £ 
scq- y 
uence 


;eq id 1 K 

40: of h 
>eptide 
eq- 
lence 


let 1 S 
od I 
i 
T 
( 
< 


£Q I 
DNO; \ 
n I 
JSSN 
)9/496 < 

m 


>redicted 1 
)egiiining i 
lucleotide 
ocation 
xnrcspondi 
ttg to first 
amino acid 
residue of 
peptide 
sequence 


E>redicted end i 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


SmTno acid sequence (A==Alaninc OCysteine, 
C)=Aspartic Acid, B=Glulamic Add, 
F=^henylalanine, GpGlycinci H«Histidine, 
l«lsolcucine, K^Lysine, L»Leucine, 
M«Mcthioninc, N«Asparaginc, P*-Pioline, 
Q=Glutainine, R=Arginine, S=Scrine, 
T=Threonine, V^Valine, W=Tryptophan, 
Y=Tyrosinc, X«Unknown, *«=Stop codon, 
/=possible nucleotide deletion, V^ssible 
nucleotide insertion 














SLASSTVOLAGQWHTfcTrKV VLTADFV'iW 

GIQLQGSVFATETLSSPPLISYIEADSPAERCG 

VLQIGDRVMAINGIPTEDSTFEEASQIXRDSSI 

TSKVTLEIEFDVAESVIPSSOTniVKLPKKHN 

VELGinSSPSSRKPGDPLVISDIKKGSVAHRT 

GTLELGDKLLAIDNIRLDNCSMEDAVQILQQC 

EDLVKLKIRKDEDNSDEQESSOAIIYTVELKR 

YGGPLG\ITISGTEEP\FDL*nSSLTKGGLAERT 

GAIHIGDRILVAINSSSLKGKPLSEAIHLLQMAG 

ETVTLKIKKQTDAi^b/WipJrJSJSJ:' riooiu-o wi-v»i/ 

VEEDSSPAQKPGKLSDMYPSHGCPSVDSAVD 

SWDGSA\IDTS\YGTEOT\SFQASGYWNTYD 

WRSPKQKGSVLSPVmPRSQTYPDVGLSYED 

WDRSTASGFAGAA\DSAETEQEENFWSQALE 

DLETCGQSGlLRELEATlMSGSTMSLNHEAFr 

PRSP AGSDRPSFQERSSaKrn T oi^ 1 1 Roi^ i i*ro 
DVGRKSVTLRKMKQEKEIMSPTTVELHKVT 
LYKDSDMEDFGFSVADGLLEKOVYVKNIRPA 
GPGDLGGLKPYDRLLQVNHVRTRDFDCCLV 

VPLIAESGNIu-ULVloKl^tiA^x^-3*^ 

n*SEONSAFF00PSHGGNLETREPTNTL 


514 


1864 


A 


3967 


833 


800 


LEKQGVSGMATKRLARQLGLlKKKSiAPANG 

NLGRSKSKQLFDYLIVIDFESTCWNDGKHHH 

SQEHEFPAVLLNTSTGQIDSEFQAYVQPQEHPI 

LSEFCMELTGKQAQVDEGVPLKICLSQFCK 

WlHKIQQQKNIIFATGISEPS/DF*SKIMaCYL 

VR*RISYTy*SKHKSKGC 


515 


1865 


A 


3969 


492 


182 


CRFV/GISTHCDTCDPLSPQ'n'EG**EGDLWSL 
DLLGPEFLARKPLFKTKTYQSTF*SISKNE/FTC 
PNFnEEGTDLlF\*QVKHNPCHRLTPEEGTVQL 

NRADS 


516 


1866 


■A 


3977 


2 


1357 


KJvlLaOkESNYlRLKRAKJ^KSMFVJKiKllAji 

GAFGEVCLARKVDTKALYATKTLFKKDVLL 

RNQVAHVKAERDILAEADNEWWRLYYSFQ 

.^Tj'MTT vrt/XifrkVTDnrjnMKyr^T T TRMGIFPESL 
nKDNLYFVMD Y JjrUUJJiVliViDJ-»-L«">^v-t>-'" * a-*^*-" 

ARFYIAELTCAVESVHKMOFIHRDIKPDNILID 

RDGHIKLTDFGLCTGFRWTHDSKYYQSGDHP 

RQDSMDFSNEWGDPSSCRCGDRLKPLERRAA 

n/MinurT AUQi vnTPNVIAPEVLLRTGYTQL 

ciwWSVGVILFEMLVGQPPFLAQTPLETQM 

KVINWQTSLHIPPQAKLSPEASDLnKLCRGPE 

DRLGKNGADEIKAHPIF*NQFDFSQ*PEDSRS 

AFKOFP*NHTTPTDTSNFDP\VDPDKLWSDDN 

EEENVNDTLNGWYKNGKHPEHAFYEFTFRRF 

FDDNGYPYNYPKPIEYEYmSQGSEQQSDEDD 

ONTGSEIKNRDLVYV 


517 


1867 


A 


3980 


1358 


1022 


" TFFKKFTQSLGFLIJFSFSFLFSCWFHh VLFCY 
VFLDRVPLCHPGWSAWQSQVTATNLPPSWD 
♦RCRPPH/LANLCNFCRDVbr 1 iLfKi^vuN i w«. 
OAIFOPOPPKVLGLQV 


518 


1868 


A 


3986 


974 


666 


- -gpENffiSffflTQAGVQWHHLSSLQPLPl^FJ^'^ 
SCFSLPE*LGYRHVPPCLANSVFSVEMG\FLH 
VGQAGLELLTSGDLPALASQSAGITGVSHRAR 

PENGFENIF — 


519 


■ 1869 


A 


3994 


751 


- 126 


" "N0GLRHVGLCRTCLVNQMFASSILGK.SHHJti2» 
USINQGHNALWKAAG\PLPLKAGYaQSFSPC 
DSLKYGXSWDEKDLTVPQRDTHKRSVLRWIS 
ORGKVUVEMEEGHCLL\LPLGTECXGIKVPIV 
HLFSSEMGEVNRPMVGXARHVTSNAALLSFTP 
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nucleotide 
location 
oorrespondi 
ng to first 
amino ecid 
residue of 
peptide 
sequence 



1870 



521 



1871 



522 



524 



525 



1872 



1873 



1874 



4018 341 



1875 



4021 



527 



528 



1876 



1877 



4026 



1878 



4028 



1067 



593 



1160 



1879 



4057 358 



"531 1881 



4061 



532 



1882 



4069 



533 



1883 



50 



tedicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 

leptide 
sequence 



377 



19 



351 



242 



Jmmo acid sequence (A'Alanine OCystemS 
I>=»Aspartic Acid, E«=Glutamic Acid, 
F=Phenylalanine, G=Glycinc, H=Histidin^ 
I=lsoleucine. K=Lysme, L=Leucine, 
M=Methiomnc,N=Asparagine, P^Prolme, 
Q=Glutaminc, R=Arginine, S=Seiine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y^Tyrosmc, X=Unknown, *^top codon, 
/=^ossible nucleotide deletion, V=possible 

- SoSSsG LHARPVlVPSLbLby^^^^ 
AH WFADLLLIITLPSYYIPFC 



■OSFRl^LLSSWUYRHM*Pm.ANFM\hbCKpR/ 

T PPT v^^wpnAn.PPRPPKVLGLQT 
TTF*FrVSCSAS*AGVRSHUNSSLgHFSl'Cj\SSN 
PPTg ASWAGATGTHHHAWLLSV 
IT 1 i'i>Tiiinc<2\/vu\/TOGYKLR" 



PPTgASWAG ATGTHHHAWLL SV 

EYGPVYSTWSALEGELAEPLEGVSACIGNCST 
AL^ELTDPMTEDFLFVLREYILYSDSMK 



AT ♦FrTnPM TEDFLFVLRfc^i ^^^^^^^"^ 

SSraEAIK^^KEKQyFL8KMVYPANLSP 
GNEGDTTSPPAK 



i^f^rrvi AiJMVLlS*PHDLPASASQSAGrrOL 



RQ(WAFMVLlS*PHDLPASASQSAGrroL 

^HP^WPTSSILS —————— 

P^KQ/F/SHLSPPSIWDYRRVPPCLW^^^ 
™<^OPCL0LLGSShffPASASQSAGIAGlSH 




NKDNRPTHPVESSF __ 

■ TPSSTSRQTKECWSSKMA WQRKtbKBHU WR 
RS^EDGWKADKP/VDO*TPGEDHLPTPSPFQ 
T Hms!S!V. sni.HHSVKSPPSLSFRL M 



"DFYLYPERKKRGQMMlAV&Li "^"'Cf" 
DVAVYFTTKEWAIMGNPAERALYRDVVO^ 
yGOCGPL*cmSKPALVFaLEQOKESCFSPA 
rriQgi S BNnWRAOWlGYLELRR YTYLS . 
GTSEL)XIQKWNWQPAFPPRP0LA1^11|QIA. 
SsAKSWVTPAWWHNKffl^ADPM 
oc*«n oTPTOVFSSPOPOLPAGEQUEOLKHAQ 



NDLLENFKtWb*l-KE»UiNMilV 
QIWMPVSLMNIVTLKCPT 
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SEQID : 
NO: of 1 
nucl- ] 
eotide 
seq- 
uence 


SEQID 1 
1^0: of 1 
peptide 
seq- 
uence 


Viet 1 5 
^od 1 
i 

1 

1 


5EQ I 
DNO: 1 
n 1 
JSSN 
39/496 
?14 


Predicted J 

seginning 

aucieotide 

ocation 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end * 
tiuclcotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


i\mino acid sequence (A^Alanme OCysteine, 
>Aspartic Acid, E=<jlutamic Acid, 
?=Ptienylalanine, G=Glycine, I^Histidme, 
Hsoleucine, K^'Lysine, L=Leuc!ne, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Xyrosine, X^Unknown, ♦«Stop codon, 
/^ssible nucleotide deletion, \=possible 
nucleotide insertion 
IFNAIPIKMPMMCMAKIEKNSS 


534 


1884 


A 


4088 


3 


1931 


IIDSSTRRMESERSPLYRQLIDLGYLSSSHWNC 

GAPQQDTKAQSMLVEQSEKLRHLSTFSHQVL 

QTRLVDAAKALNLVHCHCLDIFINQAFDMQR 

DLQrrPKRLEYTRKKEl^LyESU^NIAimK 

EMKDMIVEUimiKEELLDDATNMEI^^ 

VPENGEPVGTREIKCCIRQIQELIISRLNQAVA 

NKLISSVDYLRESFVGTLERCLQSLEKSQDVS 

VHTTSNYLKQILNAAYHVEVTFHSGSSVTRM 

LWEQIKQIIQRTTWVSPPAIUEWKRKVAQEAI 

eslsasklaksicsqfr™lnssheafaaslrq 

LEAGHSGRLEKTEDLWLRVRKDHAPRLARLS 

LESRSLQDVLLHRKPKLGQELGRGQYGWYL 

CDNWGGHFPCALKSWPPDEKHWNDLALEF 

HYMRSLPKHERLVDLHGSVIDYNYGOOSSIA 

VLLBVERLHRDLYTGLKAGLTLETRLQIALDV 

VEGIRFLHSQGLVHRDKLKNVLLDKQNRAKI 

TOLGFCKPEAMMSGSIVGTPIHMAPELFTGK. 

YDNSVDVYAFQILFWYICSGSVKLPEAFERCA 

SKDHLWNNVRRGARPERLPVFDEECWQLME 

ACWDGDPLKRPLLGIVQPMLQGIMNRLCKSV 

NSEOPNRGLDDST 


535 


1 SR^ 


A 


4090 


2 


417 

• 


■ ALMPHEANYEEIFLKTDKDMDGFESGLKVKii 

utlktr/glpstluihiwaix^dskdcgklskj:) 

HFALAFHLmQKLKGIDPPLVL'ITEKISPSNR 
ASLQKVTELTRKPVCnFKGTILWRITDSIWMK 

HNRKRIWLRA 


536 


1886 


A 


4102 


569 


829 


~bHQK*KNIPCSWIGRINlVKMSILPKAIYK^S>Ai 
PIKn>MTFFTEI*S*NVYRTrKTQE*AKAILSKK 
EONLEESHYLDFK*YYRAV ^ 


537 


1887 


A 


4104 


54 


281 


SmCEHURRMLVLDPSKRLTlAQIKEHKWML 
lEVPVQRPVLYPQEQENEPSIGEFNEQVLRLM 
HSLGIDQQKTIE 


538 


1888 


A 


4109 


141 


314 


IRftff^lKIRSWSHIJKCFyK^'U^^^ 
VPRENTTAWMNAIGLirrALPVS 


539 


1889 


A 


4111 


268 


1 


■ ASRPWGHSYP*FNQQEVDTLKRPIASSEFM^a 
I^KFATXKKSPGPYRFTAEFSHTFKEDLVPILW 
PLFPKIYREGTLPHSFYEASITL 


540 
541 


1890 
1891 


A 
A 


4142 
4146 


198 
282 


2064 
778 


■ -p^GAGRAATPWGPLKWRGRGSGRCEKJVAt, 
AALGDFLGLHRRTQQPAVDRLLSDASAQWR 
VRGHGGVRESGRAPQQPGRRRGRRPRKRPR 
GRWRREGCGAGGRGVCVAAWSQRSIAGNN 
DYRLFHKMSNSHPLRPFTAVGEIDHVHILSEH 
IGALLIGEEYGDVTFWEKKRFPAHRVILAAR 
CQYFRALLYGGMRESQPEAEBPLQDTTAEAFT 
NDXKYIYTGRATLTDEKEEVLLDFLSLAHKY 
GFPELEDSrrSEYLCTILNIQNVCMTFDVASLY 
SLPKLTCMCCMFMDRNAQEVLSSEGFLSLSK . 
TALLNEVliGDSFAAPEKDIFLALLNWCKHNSK 
ENHAEIMQAVRLPLMSLmLNWRPSGLLSP 
DAILDAIKVRSESRDMDLNYRGMLIPEENIAT 
MKYGAQWKGELKSALLDGDTQNYDLDHG 
FSRHPIDDDCRSGIEIKLGQPSIINHVRILLWDR 
DSRSYSYFIEVSMDELDWVRVIDHSQYLCRS 
WQKLYFPARVCRYIRlVGTHmVNKIFHIVAF 
ECWTNKTFTLEKGLIVPMENVATIADCASVI 
EGVSRSRNAIXNGDTKNYDWDSGYTCHQLG 
SGAIWOLAOPYMIGSIRVLLWDCDDRSY 

- ■ OTLGYFNOAROQPQDNFFAHOWSHHPPISAU 
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SEQID b 
NO: of 1 
nucl- P 
eotide s 
seq- * 
uence 


>EQID 1 
40: of h 
)eptid6 
•eq- 
lence 


Act S 
lod I 

i] 

I 
C 
c 


EQ F 
DNO: t 

Q H 

JSSN 1 
►9/496 < 
>H 1 
1 
1 


redicted 1 
teginning i 
iucleotide I 
ocation ( 
:orrespondi 1 
ig to first J 
imino acid < 
residue of 
peptide 
sequence 


Predicted end ~i 
iucleotide ^ 
ocation | 
joiresponding 
0 last amino 
Lcid residue 

peptide 
sequence 


sonino acid sequence (A-Alanme 0<:ystem^ 
>Aspartic Acid, Ep<5lutamic Acid, 
'^Phenylalanine. G=Glycine. H=HistidinQ, 
=Iso!eucme, K=Lysine, L=Lcucme, 
VI=Methionin^N=Asparagine, P=Proline, 
[)=GIutamine, R»Arginine, S=Serine, 
threonine, V=Valine, W-Tiyptophan, 
Y=Tyrosine» X^Unknown, ♦=Stop oodon, 
^=possible nucleotiac aeieiton, ^Fposbiore 
nucleotide insertion 














jlAESeJFAFWQDMKWKNKl' SVGKSLBlVi'Vij 
T\^SLPia?GDHFEWl^VTSCIHNVLSGQRW 
lEHYGEVLIRNTQDSSCHCKITFCKAKYWSSN 
VHEVQGAVLSRSGRVLHRLFGKWHEGLYRG 

^ _ i.i.ini T t-J-i A tT i'UUWA 


542 


1892 


A 


4147 


44 


433 


SVDAYVCNDIVFSYRl n'iLm<jA*LlHRYVA 
ODPKQGQLRSLHLTCDSAPAGSQGTWSTSCR 
WlFRGGAQrm.ATn)DSPKAVLGDR]L^^ 
ANVSSENKIPRmTTPQLELSVKDAVYTW 

SSH 


543 


1893 


A 1 






11 


YVAICSALQ*CSirrP/ELCQGLPVLA*AGSSLIS 

PVHTVIMSWJtfCSSAQISHFYRDAYIXMKIA 

CSHT'VNQHVFLGAWLFLAPCALILVSYDUA 

AAIUUPSPTWWKACSICSSHWLVTUnfCTn/ 

LGia'PPDSFSAQDAIATIMYTVVTSMLNPFIY 

sr MNKEVOEAVRRLFSRGSHSSWCW 


544 ■ 


1894 


A 


4158 


3 


538 


LLYAQAOVQ»LNLSSLgPgPAGIJtQSSltirsu' 
SSWDYRYSTPHPANFFVEMEFHHVAQAGl^ 

LGSGDlJTSTSHSAOrrGV^m^PPRI^ 
LLGHLLCLPMVFPLLCVFVLISSSLAGEEAAG 
LRVQKLWPAWLSHLPVCWFHCSGIWSEVIE 
I KVOREGHVLPWQAHWBF 


545 


1895 


A 


4160 


1 


412 


HPLGLGLVPSEIFSPQDKKAAUGSlLAPAROb 
DLEAOLKGSFMDGRLQASVSVFR]QRVGS>^ 
QDTASAMPCLPYYPTSHCFMAGGKSRS^W 
ELELSGEPAPGVVQVLAOVrYTQARYIJRDASE 

ANVGOPLRPVDPR 


546 


1896 


A 


4174 


1252 


1190 


TFQVFIFLFLIFFKTEmsa;PUAVyWHUUJM- 
OPPPPRFKGFSCLSLPSSWDYRHAPAHPANFV 
FLVErGFLHV\GC!\ASLELPTSODTPA»ASQSA 
GlTr,VSHHA*PRASGRRCW 


547 


1897 


A 


4176 


3029 


1 


AGPDGLAAPASCgUAKUg IRVPGAFSWLAT 

GSHHASEGLAPGVPPAGGVSAQELTAPPQEG 

WGLGAPPAAPRPESDEKBAGSDAVRSFSRGA 

RDSLGQRRLGOTRQAOPAGKGAQRTMGPAS 

GFHSFPPRPHQEPSPRSSCWQHULWHCPWPQ 

PSMi^TPAQLLQGPGVLAAPPGP*HVPGFL 

!S)a^WPLPSGPRSP*DPLHQOALVPLPQGGSP 

HTAPHCLPSVLSPAIQQPLLPTAST/S^PAS 

TMAPIPSALAVWEPAGSSPQLSSAPADSmP 

ALPKVLPPWTQKPLWCLCQSPlJtl5RPDQV 

RCPPACSPAAASSFSFESQPCPSAPSKAOT^ 

ALUVGPHHPP'SQQPQSQSVHPHOTGGPOm, 

AASSLFWA4FCQPPPPHPQFLWHRPIi»yTOKA 

]jVS«>LCHU'APGSLRQTPLm3FHIPRP0LS^/ 

PPPASGTSDSSDSRSPSASAARVWPPAXWPK' 

AARHRPHPFEmSPCPFSCMFraumraR^ 

ALQTPRAWDLPPGSSPAPLCSOPELP*APPPLP 

PFPRVA»LGSGHPPSAQVPOLWRCVWff 

RPVGHS*SGPPHSPPL*APPQAmELPPSRQC 

LQPUaRAAQPU)PCCSLSPP<H«JVPAlPS 

WGRP*SPSPASSQPPYHAOLP(H'QSSPLHra. 

pSi-SLRSOSQQPLLFFQCPOPOAyW^G^ 

STsphpppp/artqtfpvasrsl^gt/^^ 

TPSRSASSLPEWLASSLPKIPQSSGSMl^OT^ 
MP*CFHRPSPPLP/LSSPFPA\l^APQFPIJnJ 
p.ppAPSPGa»U>PIAQQH<ffSPPSPHi«Sm.T 
PPLWPSLAIi-P*PLPPl*H'VPSFSASaCSLrAH 
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SEQlt) b 
KO:of 1^ 

eotide s 
scq- u 
ucnce 


EQID ^ 
0:of h 
eptidc 
5q- 
ence 


let S 
od II 
ir 
I 
0 
9 


EQ P 
3 NO: b 
I n 
ISSN U 
9/496 0 
14 n 
a 
r 
f 


redicted ^ 
pginning n 
ucleotide U 
)cation c 
OTtcspondi t 
glofirst a 
mino acid c 
esidue of ^ 
)cptide 
equence 


redicted ena ^ 
uclcotide ^ 
)cation ^ 
orresponding I' 
D last amino ^ 
cid residue C 
)f peptide • 
sequence ^ 
/ 
1 


fluno acid sequence {A=Alanine i^^rsremc 
mA^c Arid, EKjlutamic Acid, 
-Phenylalanine, 0=Glycine, fr-HisUdme, 
=Isoleucine, K=Lysine, L«l-euciiie, 
/l^vlethionine, N=Aspaiagine, Feline, 
w}iutamine, R=Arginine, SFSerine, 
r='nireonine, V=Valine, W=Tiyplophaii, 
f=Tyrosine, X-Unitnown, *=Stop codon, 
-possible nucleotide deletion. V=possible 

inclfo^'de insertion 1 — 

SifASPGLGRSCLGKPgrLPWISt Wi'l'SCiKLA 












1 


SrSrsaXSSPGPtf'AFQPPRFGCL'CTP 
GPPGLm.QSSLSFPPPPPPVPQPPAPPALQWO 


548 


1898 




4180 


2369 


833 ~" 


HQFfSu^VLFWKLCDI^VGQPRGAW^^ 
S^^P0GW5SVRLRGVGQSCPSIJELSPW 
S»KnWKSSDVU)ILVPIimNDAR 

Son^LlWnASPYUCSLSMVTANKLL 
SJSSrmiFSAAQlWHLVFHXEVF^ 

iSHDRPmSSASGQWSPTPEWyW^iaPLQ - 

WWnvm pxTvnPWWVnTDVKLFEIQRy 
™^imLSRGKMAVlUWLbSLRMyM 


"549 


1899 


A 


4191 


85S 


321 


ii:SSSSlGQWQLEVGDPAPLG 
AcKaiKESNAJfflFMRKDTKMSFQWm 
^jSwKDVYSVSVDQKERCnrnTTNK^ 
f.pc,n.pT noHQl .PLDDALLSFA\TPTAP 


550 


1900 


A 


4192 


1 


1 1980 


SSrrsevi^sivclqkalnhu^ 

SLffiRI^KSISVCXIKELNaCSELHVEPFQEEG 

LQEQW3ELC\EILCMPHYDIDSASWS^aNQ 

TOOHVra^ETKASRREEFmSIKRQlILCM^ 

^^HTTOTSreWOVVCEDEDAFCLSLENIA-nL 

SSvlSksqneavcegvlrtqwielw 

>n)BlEELEKCKTMKKVIEAmVELVQYW^^ 
FYSQEQRQAFAPFCAEDYTESLLQLHDA^ 
SS™ra;ELFEGVQKWEErWRLFLEreR 

^SUftorggnllkeekqfah^^ 

J^^iEEUCARIEL^^^QEHSKAF^mIGQK^ 

?SwafflroLEKERAKQERQLK>0aCQ3ET 

m^SAPRTPSKRRGLAPNTPGKARKLNTT 

SS^ASaRPIFGOTVYHSPVSRLPPSGSK 

SS^STraOraOTRTGRHGANKENLELNGSI 

Isgg^pSaplqrnfsinsvastysefadpsls 

DSSTVGLQREL^ASKSDAT^^ 
- AvJMBGLVSSPAIGAYLSASyGDSLWLVAlV 


551 


1901 


A 


4194 


""3 


1008 


OSctSeslpbkmrpvswoaqisw 
lSvaotaflsilmrsujnkntvllglgfqm- 

SwOFGSQAViTvlMWAAGTVAAMSSmT' 
AI^VSKNAESDQQGVAQGnTGIROLCNGL 

SJiJoSlfgacivlmsflvalfipey^ 

WQKHSNSSSGSLTOTPERGSDEDIEPIiQDS 
SIWELSSFEEPGNQCTH. 1 
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I SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



1902 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



4197 



Predicted 
beginning 
nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 



Predicted end 
I nucleotide 

location 
I corresponding 
I to last amino 

acid residue 
I of peptide 
I sequence 



14302 



Amino acid sequence CA=Aianine OCystanc, 
D=Aspartic Acid, EF=Ghitamic Acid, 
F=Phcnylalanine, GKjlycine, H^Histidinc, 
I=Isoleucinc, K=Lysine, L=Leucine, 

j M=Me(hionine,N==Asparagine.P=Pioline, 
Q^Glutamine, R«Arginine, S=Scrine, 
T==Threonine, V=Valine, W^Tiyptophan, 
Y^Tyrosine, X«Unknown, ♦^Stop codon, 
/^possible nucleotide deletion, >Fpossiblc 

1 nucle otide insertion ^ 

■ arpppapgsrqqkqkaapoaaaaaelkgar 

EPAPARRRGTMADGGEGEDEIQFL'RTDDEW 
LQCTATIHKEQQKLCLAAEGFGNRLCFLESTS 
NSK3WPDLSICrFVa£QSLSVRALQEMLANr 

veksegqvdvekwkfmmktaqggghrtll 

yghaillrhsysgmylcclstsrsstdklafd 

vglqedttgeacwwtthpaskqrsegekvr 

VGDDLILVSVSSERYLHLSYGNGSLHVDAAF 
OOTLWSVAPISSGSEAAQGYLIGGDVLRLLH 
GHMDECLTVPSGEHGEEQRRTVHYEGGAVS 
VHARSLWRLETLRVAWSGSHIRWGQPFRLR 
HVTTGKYLSLMEDKNLLLMDKEKADVKSTA 
FTFRSSKEKLDVGVRKEVDGMGTSEIKYGDS 
VCYIQHVDTGLWLTYQSVDVKSVRMGSIQR 
KAIMHHEGHMDDGISLSRSQHEESRTARVIRS 
TWLFNRFmGLDALSKKAKASTVDLPDBSVSL 
SLQDLIGYFHPPDEHLEHEDKQNRLRALKNR 
*.«r^r-^x nxTi \n crmpl HVYSSAAHFAD 



1 CAOFSGSLDWLISRLERLEASSGILEVLHCVL 
VESPEALNIIKEGHIKSnSLLDKHGRNHKVLD 
VLCSLCVCHGVAVRSNQHUCDNaPGRDLL 
LOTRLVNHVSSMRPNIFLGVSEGSAQYKKWY 
YELMVDHTEPFVTAEATHLRVGWASTEGYSP 
YPGGGEEWGGNGVGDDLFSYGFDGLHLWSG 
CIARTVSSPNQHLLRTDDVISCCLDLSAPSISF 
RTNGOPVQGMFENFNIDGLFFPWSFSAGIKV 
RFLLGGRHGEFKFLPPPOYAPCYEAVLPKEKL 
KVEHSREYKQERTYTRDLLGPTVSLTQAAFT 
PIPVnrSQIVLPPHLERIREKLAENIHELWVMN 
KIELGWQYGPVRDDNKRQHPCLVEFSKLPEQ 
ERNYNLQMSLETLKTLUU.GCHVGISDEHAE 
DKVKKMKLPKNYQLTSGYKPAPMDLSFKLT 
PSQEAMVDKLAENAHNVWARDRIRQGWTY 
QIOODVKNRIWRLVPYTFLDDRTKKSNKDS 
LREAVRTLLGYGYNLEAPDQDHAARAEVCS 
GTGERFRJQFRAEKTYAVKAGRWYFEFETVTA 
GDMRVGWSRPGCQPDQELGSDERAFAFDGF 




NEHTMMr llsskjgu^^'^^^^^'^'^**' • — 

GFIPVCSLGVAQVGRMNFGKDVSTLKYFTIC 

GLOEGYEPFAVNTNRDITMWLSKRLPQFLQV 

PSNHEHffiVTRTOGTlDSSPCLKXn'QKSFGSQN 

SIOTIMFYRLSMPIECAEVFSKTVAGGLPGAG 

LFGPKNDLEDYDADSDFEVLMKTAHGHLVP 

DRVDKDKEATKPEFNNHKDYAQEKPSRLKQ 

RFLUIRTKPDYSTSHSARLTEDVLADDRDDY 

DFLMQTSTYYYSVRIFPGQEPANVWVGWn| 

DFHOYDTGFDLDRWTNTTVTLGDEKGKVHE 

SIKRSNCYMVCAGESMSPGQGRNNNGLEIGC 

VVDAASGLLTFIANGKELSTYYQVEPSTKLFP 

AVFAQATSPNVFQFELORIKNVMPLSAQLFKS 

EHKNPVPQCPPRLHVQFLSHVLWSRMPNQFL 

KVDVSRISERQGWLVQ(XDPLQFMSLHIPEmN[ 

RSVDILELTEQEELLKFHYHTLRLYSAVCALG 

NHRVAHALCSHVDEPQLLYAIENKYMPGaR 

AGYYDLLIDIHLSSYATARLMMNNEYIVPMT 

EETKSraFPDENKKHGLPGIGLSTSLRPRMQF 

SSPSFVSISNECYQYSPEFPLDILKSKllQ]Nffl-ra 

AVKEGSLHARDPVGGTTEFIJVPUKUnrrLU 
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"SEQ" 
IDKO: 
in 

USSN 
09/496 
914 



as 

beting 
nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 



Predicted eno 
nucleotide 
location 
corresponding 
to last amino 
acid residue 



of peptide 
sequence 



■Amino acid sequence (A=Aianme C=Cysteine, 
ENAspartic Acid, E^lutamic Acid, 
F=Phenylalanine, G=Glycine. H=HisUdmc 
I=Isoleucine.K=Lysine,L-Leucme, 
M^Methionine, N=Aspara^ne, P=Prolme. 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X»Unknown, ♦-Stop codon. 
/-possible nucleotide deletion, \=possible 
nucleotide insertion 




Ai TARKTKEFRSPPQEQINMLLMFKDDKSECy 




SoNQKAMFDHLSYLLENSSVOLASPAMRG 

?5^GcS^CQNaVSKGYPDI0WNPVE0ro 
YI^FLW'AWa^GESVEEtWNV^^ 

npSGSSKTIXTTEEBEDOTTHMGNAIMTFY 
LffLGDLVGVISlAFQMFnAKDGNVVEPDMS 
EVGFlPDLRAAASIXn-AAWATDMALALmY 




KS^WOFKDIXLDTPSIEKMA 





^SMOTSLIVAALKRLLPIGLNICAPGDQEL^ 
1 ?frFnrTTH VK/AVAT1TMAKSCHDEEDDPG 
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ASho add sequence (A=Aianme v^y.«-^ 
I>Aspartic Acid, E-Gl«tomic Acid, 
F-Phenylalanine, G-Glycine,lt=»Histidin^ 
Msoleucine, K=Lysme, 
M=Mcthionine, M-Asparaginc, P«Prolmc. 
0=Giutamine, R^AiEjnine. SF^enne, 
Tvnireonine, V^Valine, W=-Tiyptophan, 
Y=Tyrosine, X=Unknovm, ♦=Stop codon, 
/^possible nucleotide deletion, V=possiWe 
nucleotide insertion 



E^)wTISASKGETGra4VAATUCU3IAILNOG 

9iHi:, i:Jr.^_.^DXTPQv atovakovfntltey 




^ru^^<!HKHYTOSErEaLSCABroENE 




pmn)MEGGEKEKMElJ^CEDTIFEMQLAA 




TsSFFRnraUXGGSLVEGAKKIKVAm. 




KLGVVFTONSHJI^^ — 





I.WTDNfTLVLH 
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WO 01/57188 



'SEQIU 
NO: of 
nucl- 
eotide 
seq- 
uence 



"SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
bod 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



Predicteci 

beginning 

nucleotide 

location 

conrespondi 

ng to fust 

amino acid 

residue of 

peptide 

sequence 



1906 



T57 



1907 



559 



1908 



"Predicted enu 
Ducleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



4213 



422r 



774 



560 



561 



1909 



1910 



1911 



4235 



507 



IST 



4246 



"EST 



1300 



323 



654 



aSSo acid sequence (A^Alamne Otysleiue, " 
D-Aspartic Add, E=Glutamic Acid, 
F=Phenylalamne. G=Glycine, H=Histidine, 
l=Isoleucine. IC=Lysine, l^^e^cin^ 
M=Methionine, N=Asparagme, P=Prolme, 
(W}lutamine, R=Aiginine, S=Serine, 
T^Threonine, V=Valine, W==Tryptophan, 
Y«Tyrosine, X-Unknown, *'=Stop codon, 
/^possible nucleotide deletion, \ppossiblc 
nucleotide insertion 




^T»m-PPSTLSVKOQ "^VK.G™ 
ARRFSCLTLylSWOHRHVUPm^^Vl; 
^^VrnAfiliKl.PrSGDPPASASQSARn 



SoQAGWrSGDPPASASQSARTTGMS 
mrArn ftW mgnaJFlVYKIMYTL 

li'rJ^^i^^I^irvrvimnRKYSFOKGPLNRI 




KKVP 



HHAWLQVLNS ^ 




i^r^.DGSSPlAOSIHRGLLSRESWH>VNN 




JSffio^&SKTYETnmRLTVEEATA 
^,fcpn^.l noiS MKDSDEEDEE DP 

iSkSLQYVFnYKLVELEPKSKTYlL. 
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SEQID 
KO: of 
nucl- 
eotide 
seq- 
uence 



SEQID 
KO: of 
peptide 
seq- 
uence 



Met 
hod 



562 



1912 



563 



564 



SEQ 
ID MO: 
in 

USSN 
09/496 
914 



Predicted [Predicted oia 

beginning nucleotide 

nucleotide I location 

location corresponding 
correspondi to last amino 

ng to first acid residue 

amino acid of peptide 

residue of j sequence 
peptide 
sequence 



4260 



1498 



1913 



A [4265 



1914 



A I 4270 



566 



1916 



A [429^ 



167 1917 



568 



1918 



4299 



623 1 116 



83 



368 



406 



1041 



4300 2012 
A U302 I W 



229 



1106 



Amino acid sequence (A-Alamne C^yslem^ 

I>Aspartic Acid. ^J"^*^-^*?^^:^ 
F=Phenylalanine, CH31ycine, H=Hisbdine, 
Wsoleucine, K^Lysine, L^Uucinc, 
M=Mctiiionme, ■N=Asparaginc, P^Prolmc, 
Q=Glutamine, R=Arginine, S^Serine. 
T=Threonine, V=Valine, W«Ttyptophan, 
Y=Tyrosine, X^OJnknown, *=Stop codon, 
/^possible nucleotide deletion, \Fpossible 

nucleotide insertion - _ , 

INUEPVEEDAjEMRGDQOTKl lOLLMl^j^jG^ 



1843 



531 



FMKG^nrIKEreAWDFLLAL\GVYPTiC^ 
PAOYCEALADFEMRARPQPSGPAPSS 



^^^^^^ 

PUV^GQREAARIHNHLPEIQKFSSRIREA^ 

ToSmSFFIALm4ANLPVPStXFCK3Lyn^ 
SmSDPimH^VTATMWAVLEl^G 

LR^RMraaQLEUUVRGPLRQTFTHNPLLQ 



TOGT^TLSMPSNNLPLPLSSSATM^^ 
m.SPNOGlPRQSPVKmTlPMSFQPVPGGW. 
reloiWmOT^ILTAPPAIiPHPPIHFrQQSF 

DRMEWOKSVF 



n"KRKLSSLNSEVSTlQNrKMLAFKAlAy^ru> 
S^WLlSraPAAQVMAYLFTIINSLQOF 

GOTOASRTPRT 



GS^VVmNRLGMMIDmASDUIBK 
^SQAPVIFSHSAARAVCDl^NVPDDILQ 
l5^K?rafflVMmSM0VLQCNUJ^STVA 
n^JroH^vroSEFIGIGGNYDGTGKPPQGLVE 
S^SS^SsRSWSEEELQGVLRGNJXR 

wKkvreesraqspveaefpygqlstsch 

FHLGASEWTPRUJWR =rs^rc5n 

GATPLOSVOORiGKMDAATLi i<UiLKt-/mii 
DWKTSH'VWILGFKYSinEKDEILSDVASRL 
;?SS3GrGPrS0rGWGCMLRCGQ 
KQiaVCRHU)RDWRWTQRKRQPDS^^ 

Sfidrkdsyysihqiaqmgvgegksigq 

WYOTOTVAQVLKKMVF0WSSLAVHIAJ^ 

SJ^SSrlcrtsvpcaoatafpadsdrh 
SSpagaevtorpspwrplvixipuu^ltd 

KvmKHCFMMPQSLGVlGGKPN&^Hy 
SQYVGEELiyLDPHTTQPAVEPTD^ffDES 
FHCQHPPCRMSIAELDPSIAVVRGGHIiSTQAF 

n AECCLOMIRKT PGFT.RFFFSMLG 
SRKFLTnHVLYFLTSFYlKVUQiHHUjl^^ 
I x/iQVT.TPKLPOLHGVRlFGINKY 



7-^ in AuTWPRSARWLLWU^^Ji^aAHRg 



211 



PCTAJSOl/03800 



WO 01/57188 




Amino acid sequence (A^AIanine 0=Cysteme, 
D=>Aspaitic Acid, E^lutamic Acid, 
F^Phenylalaninc, G=01ycine, H=Histidiiie, 
I=Isoleucine. K=Lyslne, L=Leucine, 
M=Methionine, N«Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V^Valine, W=Tiyptophan, 
Y«Tyrosine, X=Unknown, *«Stop codon, 
/'^possible nucleotide deletion, >?T)OSSiblc 
nucleotide insertion 



nucleotide inserti on 

IXHCARJLNGKPVCEDSt-SgHVRVNVCVSl^lHJ 
CVWWGVGCVKCLPPRAHfflWQEKPLGPHRT 
VTFRTa.KAEGKTKEKAREKERKKKS 




GKRNKLRVYYLSWLKNJ<.il^iJrxivx:.iviv.v£vj 
WTTVGDMEGCGHYRWKYERIKFLVIALKSS 
VEVY AWAPKPYHKFMAFKSF ADLPHRPLL V 
TM oHnjrnnnT rvTYGSSAGFHAVDVDSGNSY 



DIYIPVHIQSQnTHAnFLPNTLKjMbJVLLi-'- 1 c 
DEGVYVNTYGRIKDWLQWGEMPTSVAYIC 
SNOIMGWGEKAIEIRSVETGHLDGVFMHKRA 
QFLKFLCER>JDKVFFASVRSGGSSQVYFMTL 

NRNCIMNW 



DFPETSEPVWILGRKYSIFTBKDBlLSUYAaiu. 
WFTYRKNFPAIGGTGPTSDTGWGCmRCGQ 
M1FAQALVCRHU3RDWRWTQRKRQPDSWS 
VLNAFIDRKDSYYSIHQIAQMGVOEGKSIGQ 
WYGPmVAQVLKKLAVFDTWSSLAVHIAMD 
NTVVMEEIRRLCRTSVPCAGATA3FPADSDRH 
CNGFPAGAEVTNRPSPWRPLVLUPLiaGlAT 

dWyvnetlvkhofhgwpqfpg/vvhregk 

PNSAHYFlGYVGEELryLDPHTTQPAVEPTDG 
.M.mrM7CPUPnHPPCRMSIAELDPSIAVVRGGH 




KEYSHILVOLEDGKLl VYOVUfcrAti V nxowi^ 
FISHAVGDYFGSPSFQLIEKSPLGINKLKAKFD 

FSKGSK 



FSK.GSK. . 
MDTLEEV1•WANGSTALPPP1J^1'N1S VPHKCLL 

LLYEDIGmVRYWDLLLUPNVLFLIFLLWK 

LPSARAKIRHBSPIFnT^VmALVGIAIU 

WSMTVSTSNAATVADKILWEnWFLLAIEL 

Sl^GHLESKSSIKRVLAinVLSLAYSV 

ir^r^TT mr vmAHLSAEDFNIYGHGGRQFWL 



VSSCFFFLVYSLVVILPKTPLIfUiKiSbi'aiu^or i 
WAGII^LNLLQGLGSVaCFDIIEGLCX:VD 
^TTTTT VPS FPAPLIYVAFLRGFFGSEPKILF 
"^^.rrxxmvinxin Agnitnr T.nCRIOLAEKFVKA\ 



ATTFLYFS FFAPLIYVAl'LKUrruocrrsaL^ 
GCWWRHPWLASQRDCLDCRIQLAEKFVKAV 
SKPSRPDN!NPIRVKEVYRLEEMEKIFVRLEM 
_.„_,^««^^^T ovrnDnnDUPVPMni.YIVRT 




VhmPWTMGFSKSFKIUS>f i-YNK^i*^^ i rui.r 
ADSIAPFHICYYGRLFWEWGDGIRVHDSQKP 
nnpn KLSKEDVLSFIQMHRA 
rT.^D^v-DT:wP'i'AWRISPV\ 



6VEGRQGREVKRTAWR1SPVWRPARCRKRS1 
POP/PE/PGAQQQERHRQGEAPMQALDPRAEP 
GPQAQSHAACQPEPBPPRVLLDFTAAR^^ 
GRP/GLSRHPGLAPHPQTOTPWPQSGRLPCAS 
PPT PirTGlRPTPGLEPKGRDLM 
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NO: of 
nucl- 
eotide 
seq- 
uence 



577 



578 



SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



1927 



579 



S80 



1928 
1^ 



1930 



581 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



4366 



Pi 
b( 

nucleotide 
location 
correspondi 
ng to first 
amino add 
residue of 
peptide 
sequence 
785 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



4367 



4383 



4397 



410 



4414 



583 



585 



586 



1933 . A 

1934 |X 



4424 
443r 



4439 



1935 



1936 



587 



588 



589 



1^7 [A 
1938 \T 



1939 



670 



502 



221 



224 



94 



194 



44r 



4463 
446r 



4471 
4480^ 



4487 



10 

noT 



166 



628 



144 



1^ 



922 



387 
l45r 



332 



wSno acid sequence (A^Aianinc i 

I D-Aspartic Acid, E^J^*^^^^ . ^ 
, F=Phenylalanine,G=OlycmcH=Hislidme, 

I l=lsoleucine.K=Lysine.L=Leucm| 
M=Methionine, N=Asparagme, I^Prolme. 

(M}lutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, W=Tryptopton, 
Y-Tyrosine, X«Unknown, *«Stop codon. 
/=possible nucleotide deletion, \=possibte 

i nucleotide in sertion j . 



^SSwDHRHAPPHLAHFCIFSRDGVSP 




i SAPPALLQDTSV 





rcJXSAAGFQPHKGSGnQTWRYGQW 

njKmQDCLAISGVRWVLFGymQSGC^^ 

KSS^KVKSLLWGQGFPDYVA 

KvSwTKYGSLLNPQAKIVNWAI;«J 
NTfHKPPCPATKRRHSOSl 




CPANrCUU/i-Ti.viiiNj' • — < 
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SEQ uJ 
NO:of N 
nucl- pc 
eotide se 
seq- u< 
uence 


IQ ID Met 
D: of hod 
:ptide 

snce 


SEQ ^ 
ID "NO: be 
in tu 
USSN Ic 
09/496 a 
914 n 
a 
n 
P 


edicted 1*1 
sginning ni 
icleotide 16 
cation c< 
jrrespondi tc 
gtofirst a 
tnino acid 0 
ssiduc of » 
eptide 

equenoe __ 


edicted eofl A 
icleotide I> 
cation ^ 
irresponding 1= 
> last amino N 
sid residue Q 
fnentide 1 

squencc ^ 


5ino acid sequence (A-Alanme t>C^stelni 

aAsoartic Acid, E=Glutaraic Acid, 

=Ph6nylalanine, G=Olycine, H=Histidmc, 

=Isoleudne.K=Lysine,L=Leucme, 

I=Methionine, •N=Asparaginc, P-Proline. 

NGlutamlne, R=Arginine, S=Serine, 

•^Threonine, V^Valine, W^Tiyptophan, 

'"Tyrosine, X=Unknowii, *«Stop codon. 

^possible nucleotide deletion, >Fpossiblc 

ucleotidc insertion 1 

/WGTGDR'IERLQSQVLQAMRQRCjIAVEVQ 










I 


OTNACATFNFLCHEGRVTGAALIPPPGGTSL 
SF^SRSVAQAGVQWKULCiSLQAPPF W 1 


[ 590 i 


940 


■^■4492 




[72 i 


SSS/GDPFTSASQSAGrrGVS^^ 

S>Seprrkcgnavvwpstslgdhrvts 


591 


1941 A 


4495 


1444 




LKDNWVEDTIPQGA VKU 1 ai 1 
ppumgRrFPHPSSGV J 




I9S 


"^"4496 


2 


919 


JS^SdSavpiatysvvqlrrqrpdf 
Kvcwpv>maswAKKQAGinvi^ivm 

SALPAVOWHOTSEOTGCmm^ 
GLGFGVCFLLLVOGSVAMGVICTAIALFQTL 

SoVGRQAbl^ 


593 


1943 A 


4506 


2 ' 


193 
"647 


mSTTOCAVKPSVTCVDQDFKSQOffn 
-S&?SSI15^AlSAPO^ 


594 


1944 A 


4507 


1327 




595 


1945 A 


4512 


533 


264 


SPASASRV/AGWGAWmiQ^LVQMRVH 
^r^"^r.^ ni I /N1.MIHPPRSFKVLGLQA_J 


596 


1946 A 


, " 4513 


3 


j 1674 


HSVSSTNGHRWQIFQDWLGTDQDNLDL^ 

SSlvqkkkqiWshaaqlqiu^k 

DFNLS?KaASGSDDAKVKLWSTOLDNSV 
^SpA^^CmSPSSRmAFGCADHCV 

^^SSSrkavsyakfvsg 


597 


1947 


A 4518 


" 536 


■ 824 




598 


1948 


A 4524 


1 


384 





214 



wo 01/57188 



PCTAJSOl/03800 



"SEQtD b 
NO: of 1 
nucl- F 
eotide £ 
seq- I 
uence 


EQID ^ 
40: of t 
peptide 
»eq- 
lence 


Act S 

LOd I 

i 
I 
C 
c 


EQ I 
DNO: t 
n r 
JSSN 1 
)9/496 ( 
)14 I 
■ 


>redicted 1 
jeginning i 
luclcotidc 1 
ocation ( 
xjrrespondi t 
ig to first 1 
amino acid 
residue of 
peptide 
sequence 


Predicted end 1 / 
lucleotide 1 
ocation ! 
;orresponding 
0 last amino 
Kid residue 
[>f peptide 1 
sequence 


^mino acid sequence (A=Aianme OCystemc, 

'^=>A«nnrHc Acid. EpGlutainic Add, 

'^^Phenylalanine, G=01ycinc, H=H!Stidinc 
=Isoleudne, K=Lysine, L^Leucine, 
Vl=Methionine, N=Aspaiaginc, P=Pit>line, 
C^lutamine, R=Arginine, S^Serine, 
T=Threonine. V=Valine, W=Tryptophan, 
Y«=Tyrosine, X^Unknown, *'=Stop codon, 
^=possible nudeotide deletion, \ppossible 
nucleotide insertion 














RRREMQSQSVMLALRRGDAV WbLsriuHbO 
vr A VQ>JWf;Tf VTTFSG"FLVYPDLAPAAPPGLG 


599 


1949 


A 


4526 


366 


776 


MGQPAPYAEGPIQGGDAGELCKCDFLVri sr 

npeavceagtpamfqtawrqmescsi/aqag 
vqwrdpgslhppplgfkrfsclslpsswdyk 
happhpanfcifsrdqvspcwpgwsrsldlvi 

PPPWLPKVLGLQA 


600 


1950 


A 


4529 


776 


334 


FFFErESCYVAQAGVQWCDLCSLQAPPPO\55 
DPPASASRVAGTTGARHHTQLIFVFLVETGFH 
\MLARDGLKLLTSSDPPASASQSSWDYRREPP 
RLANFFVFLVETGSRYVAQAGVQWLFTGAIP 
T T JSTGVLTCSVSDLGRFTPP 


601 


1951 


A 


4533 


1460 


403 


■"HEVQESMFLESEFSRCjiSUNY iLAUl iALbb 
VGSPKAKEALNMLTWRAEQEGGMQFWVSSE 
SKLSDSWQPRSLDIEVAAYALLSHFLQFQTSE 
GIPIMRWLSRQRNSLGGFASTQDTTVALKALS 
EFAALMNTERTmQVTVTGPSSPSPVKFLIDT 
HNRLLLQTAELADGTANGSV/SISANGFGFAI 
CQLNWYNVKASGSSRRRRSIQNQEAFDLDV 
AVKENKDDLNHVDLNVCTSFSGPGRSGMAL . 
MEVNLLSGFMVPSEAISLSETVKKVEYDHGK 
LNLYLDSVNETQFCVNIPAVRNFKVSNTQDA 
SVSIVDYYEPRRQAVRSYNSEVKLSSCDLCSD 

VORLPSL ^ 


602 


1952 


A 


4540 


1963 


295 


MRAPGRPALRPLPLPPLLLLLLSSPWGRAVru 
1 VSGGLPKPANITFLSINMKNVLQWTPPEGLQG 
VKVTYTVQYFIYGQKKWLNKSECRNINRTYC 
DLSAETSDYEHQYYAKVKAIWGTKCSKWAE 
SGRFYPFLETQIGPPEVALTTDEKSISWLTAP 
EKWKRNPEDLPVSMQQIYSNLKYNVSVLNT 
KSNRTWSQCVTNHTLVLT W\U3ri> 1 1 ^ V ri V 
ESFVPGPPRRAQPSEKQCARTLKDQSSEFKAK 
nFVmOJISrrVFLFSVMGYSIYRYIHVGNKEK 
}fi^ANLILIYGVNEFDKRFFVPA\EKIVUNFI\TL 
NISVDDSKISHQDMSLLGKSSDVSSLNDPQPSG 
NUtf PQEEEEVKHLGYASHLMEIFCDSEEKn 
EGTSFTQQESLSRTIPPDKTVIEYEYDVRTTDI 

1 ^ A ^T%r3-arMfX CT AWPV^TOriTT 1 .ESOAALAVL 

1 CAGPBBQ""^*l-*v*^»^ " " * ^ *- 
GPOTLQYSYTPQLQDLDPLAQEHTDSEEGPEE 

EPOTLVDWDPQTGRLCIPSLSSFDQDSEGCE 

PSEGDGLGEEGLLSRLYEEPAPDRPPGENETY 

TMOFNfEEWGLYVOMEN 


603 


.1953 


A 


4543 


3 


600 


I YSAVEFVEQASGISDWWNPALKKRMLSDSOL 
GNflAPYYEDSDLKDLSHSRVLQSPVSSEDHAI 
LQAVIAGDLMKLIESYKNGGSLUQGPDHCSL 
LHYAAETGNGEIVKYILDHGPSELLDMADSE 
TGETALHKAACQRNRAVCQLLVDAGASLRK\ 
TDSKGKTPQERAQQAxGDPDLAA/ 1 1 iiioK.viN 

YKVIGHEDLETAV 


604 


1954 


A 


4548 


3 


938 


1 ODNKVQNGSLHQKDTVHDNDFEPYLlugAN 
OSNSYPSMSDPYLSSYYPPSIGFPYSLNEAPW 
STAGDPPIPYLTTYGQLSNGDHHFMHDAVFG 
OPGGLGNNIYQHRFNFFPENPAFSAWGTSGS 
OGOOTQSSAYGSSYTYPPSSLGGTWDGQPG 
FHsSksKAPGMNSI^QGMVGUaGDVSSSA 
VKTVGSWSSVALTGVLSGNGGTNVNMPVS 
KPTSWAAIASKPAKPQPKMKTKSGPVMGGG 
1J>PPPIKHNMDIGTWI>NKGPVPKAPVPQQAP 
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SEQ ID 1 
NO: of N 
nucl- P< 
eotide si 
scq- u 
uence 


£Q ID Met 
0: of hod 
3ptide 
jq- 

OICC 


SEQ Fi 
ID NO: b( 
in ni 
USSN Ic 
09/496 0 
914 n 
a 
r 
P 


redicted y 
^ginning o 
jcleotide Ic 
)cation c 
oncspondi t( 
gtofirst a 
mino acid o 
esidue of s 
leptide 
equence 


redicted end a 
iicleotide E 
>cation F 
onresponding 1= 
> last amino ^ 
cid residue C 
f peptide ^ 
equence ^ 
/ 
r 

—————— -j 


Sino acid sequence {A^AlaBinc u=uysiemc, 
^Aspartic Acid, &=Glutanuc Acid, 
-Phenylalanine, G»Glycine, H«Histidme, 
=Isoleudnc,K=Lysine,lM^cine, 
1-Mcthionine, N=Asparaginc. P=4>ioUiic, 
)=Glutainine, R=Arginine. S=Serme» 
'=Threonine, V«Vatine, W-Tiyptcphan. 
f=Tyiosine, X^Unknown, ♦«Stop codon, 
=possible nucleotide deletion, V=possible 


605 1 


1955 A 


4553 : 


I 


1304 


tKaALTUCSLQLYVCSVTPQLQEELLGIAQIN 

^qSaSwqadsvdsgcsnctqtsppy 
^Sdsiujhptaaqagpyspekfqpspl 

iffiroAQSGQrDLPPWVLRDERLRGLLR 

S^qSyrheqaaek^ 

PADDV - '■ 1 

"^,SGPGPAPFLAFVAAPVGGISFHLQIGL^>Klir 




T956 "A 


4555 


^429' 


"776 


S^OSVKHTKRKSSTVMKEGWVIVH^ 
SLsEnSvKTSALIPNGANPHCFErrrANV 

KL^OFARflGEB^SFRRSWGTPAYLAPEVL 
- TRP\mUtASERPSAPSAMAmRGPOKK>^ 


607 


1957 


A 4563 


1 

_ 


4499 


^^^^^^ 
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Amino acid sequence (A=Alamne C=cysteii«i, 
D=Aspartio Acid, EpGlutamic Acid, 
F=Phenylalantae, G=Glycine,H=Histidme, 
I=lsoleucine, K=Lysine, lj=Lcucine, 
M=Meaiionine,N=Aspaia^e, P=PK)line, 
n=Glutamine, R=Arginiiie, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyiosine, X=Unknown, *=Stop codon, 
)t=possible nucleotide deletion, V^wssible 
nucleotide insertion 



n ucleotide insertion =rnss!SrrSi— 

■HmQIETLDPUSbX}HlPKLElU>LHha«UlHL 

VPGTFNHlXSMKW.RIJDSbmJiCD^W^ 

DLLKTYAESCJNAQAAAlCEYPRMQOMVAn 

TPEELNCBRPRlTSEPQDADVTMOT^n|Tr^ 

ACfiMPTfPFtTWLRNNNELSMKTDSRLNLLDD 



GTLMlQmX}ETDQGIYQCMAKNVAiJiiv*.iW 
ECSATGHPPPWSVmODRTPU'VDreVKrrPS 

Sqnwqgdsgeyacsatnnidsvhata 
SKftvtpqdrvviegqtvdfqceak 

S^iSmGGSQLSVDRm 
?aJS.roQGQYECQAVNnGSQKVVAHLTVQ 

I^^^^aSdttvevganvqlpcssqotp 



EPArm«IX5VQVTESGKFfflSPEOt'l-li«uv 

OT™K^EwWnGSASVSMVL^^ 
VSM^GDPFVATSWEAIATVDIUUNSTOTHLF 

P/A.iAippcmOHCSNVCSNDPPCFSVMIPPN 







^.^o^''^^^ 1 TnsOKVLACGLNEFNKL 




SSgINLLGGPU3GKQVIRVSCGD 
™AinnNTbAWGNGGNGRI^UvrrPTERP 



GGPDAW 



C C PD AW — — — — 
FFrofra^SVAQAGVQWmSS^ 

KLSFHVLSGS 



T.VIRPPRPPKVLGLQA 
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"SEQID S 
NO: of > 
nucl- p 
eotide s 
seq- t 
uence 


EQID N 
10: of h 
eptide 
eq- 
ence 


dct S 
od D 
11 
I 
C 
5 


EQ 

DNO: 

a 

JSSN 
19/496 
)14 


Predicted i 
beginnmg t 
nucleotide .1 
location < 
correspondi t 
ng to first i 
amino acid 
residue of 
peptide 
sequence 


Predicted end ^ 
lucleotidc 1 
ocation 1 
jorresponding 1 
o last amino i 
icid residue ' 
»f peptide 
sequence 


Gnino acid sequence (A^Alanme OCysteme, 
>-Aspartic Acid, &=01utamic Add, 
-Phenylalanine. G=<}lycinc H=Histidiiie, 
=Isoleucine. K=Lysine» lJ=Leucine. 
Vl=Methionine, N=Asparagine, I^Proline, 
5=Glutamine, R^Arginine, S=Serine, 
r=Threonine, V=Valine, W^Tiyptophan, 
Y^Tyrosine, X-Unknown, *«Stop codon, 
^possible nucleotide deletion, \r=possible 

nucleotide insertion 

ADPHTTVIRFFPAASATKRVLPPVLRVSSKKl 

WNPNVPESPRIPAPRLPKRMSGAFrAGAALM 

LCAATAVLLSAQGGPVQSKSPRFASWDEMN 

in Auni 1 AT nAr;\PA>JT\OAHPOSAERAGA\R 

LSACGSACQGTEGSTDLPLAPESRVDPEVLHS 

T oTf-vi vAmsKiPinni FHKVAOOORHLEKQHL 

RIQHLQSQFGLLDHKHLDHEVAKPARRKRLP 

EMAQPVDPAHNVSRLHRLPRDCQELFQVGER 

OSGLFEIQPQGSPPFLWCKMTSDGOWTVIQR 

«t 7T-i/->o\rrvcxn> PWP A VTf AOFODPHGEFWLGL 

EKVHSITGDRNSRLAVQLRDWDGNAELLQFS 
VHLGGEDTAYSLQLTAPVAGQLGATTVPPSG 
LSVPFSTWDQDHDLRRDKNCAKSLSGGWWF 
GTCSHSNLNGQYFRSIPQQRQKLKKGIFWKT 
WRGRYYPLOATTMLIQPMAAEAAS 


"en 


1961 ; 


S ' 


♦571 


25 


T396 


612 


"1962 


A 


4575 


162 


3 


FFFETESRSVAQAOVQWRDLSSLQPPPPCiXSK 

n^PASASPVAGITGTRHHRTRG 
PLAQRRPFLWVTVKTOGHlWGSSTYPHfWus 


613 


"1963 ■ 


A 


4584 


687 


321 


SNS/PASASQVAGPNARHQARIIFVFLVEIOT 
HHVGRAGlAjFUNLAiULrr i^rirR. v iajij v^^*^ 
1 MfCPHPAHKYlSMIQFNVHFMCas4SVHIYI 


1 614 






4589 


727 


299 


I'GSAQSAQRGRGRiaOflJIAGSAlVllMYsrMu 

T^/^AiiA/z^Tn T WAA/fATDHWMOYRLSGS 
GGLFCAWVGi iLLV v AMAii-'n wiwy i ru^ww 

FAHQGLWRYCXGNKCYLQTDSIAYWNATRA 
FMILSALCAISG^MGIMAF/GWVAVL^f^FFA 

GIFYMCAYRVHECRRLSTYR 
-XuJEiaQAWA^QaPQS^^ 


i ct < 


"1065 


"a 


4590 


"T2 


■414 


r"T'/-« n T D rM-ivQCP VHR P BTHSPLGAAWEVADFO 

PEQVETQPRWSREEPGSLHSGHQEQLNRKR 

ERRPLPKNARPSPWVPALADEWNTLHQEVTT 

TRT.PAGSOEPVKD 


616 


"1966 


A 


4592 


773 


488 


DFALVAQAGVQWHNLGSPQPLPPGFKKfS^L 
SLPSSWEYRCVPP/RLANFVFLVEMGFLHVGQ 
AnT.F.TPTSGDPPALASOSAGrrGVrrVPSGPG 


617 ■ 


-19^7 


B 


4595 




84 


478 


XRHGLREPLLERRCAAASSFQHSSSLGRbLi'Y 
DPVDTEGFGEGGDMQERFLFPEYILDPEPQPT 
REKQLQELQQQQEEEERQRQQWIEERRW^ 
RARSREHPWGHPDPALPPSGVNCSGCGAEL 

HCODAR* 


618 
1 619 


"1968 
1969 


A 
A 


4596 
4601 


2945 
2 


1188 
357 


■ ARSRNSARGVYGMCVPlU'LCFJLlsULbKl-jDU 
SAERPYFMCSTLKKPLARRCFPAIHAYKGyL 
MVGNETTYEDQHGSRKNITDLVEGAKKANG 
VLEARQLAMRIFEDYTVSVmmiGLVIAMA 
MSLLSnUiaLAGIMGWVMIIMEI\SELGYRIF 
HCYMEYSRLRGEAGSDVSLVDLGFQTDFRV 
YUlLRQTWLAFMIILSILEVIItt.LLIFLRKRILI 
AIALIKEASRAVQYVMCSaYPLVTFFLLCLCI 
AYWASTAmSTSNEAVYKIFDOSPCPFTAKT 
CNPETFPSSNESRQCPNARCQF AFT uuiiau i n 
RALLGLQIFNAFMFFWLANFVLALGQVTLAG 
AFASVYWALRKPDDLPAFPLFSAFGRALRYH 
TGSLAFGALILAIVQIIRVILEyLDQRLKAAEN 

KFAKCUtfTCaJCCCFWaEKFIKFUTO^ 

lAIYGTNFCT^ARNAFFLLMRNIlRVAVLDKV 

TDFIJLLGm.IVGSVGnJVFFITPTHRIRIVQDT 

APPlJmWVPILTVIVGSYLIAHGFFSVYGMC 

VDTLFLCFLEDLEKNDGSAERPYFMSSTLKKL 

1 NKTNKKAAES 
- -RT^vj^VlLGEFyRKl^NNTKSrVKreYKAiiiif 
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SEQID 2 
NO: of 1 
nucl- I 
eotide i 
seq- ' 
uence 


JEQID 1 
srO:of i 
)eptide 
»eq- 
lence 


Act S 
lod I 
i 
1 
( 


;eq f 

DNO: I 
n I 
JSSN 1 
39/496 
?14 


>redicted I 
)egirmmg i 
lUcleotide 
ocation 
:oiTC5pondi 
ig to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end i 
lucleotide 
ocation 
:orrcsponding 
to last amino 
acid residue 
of peptide 
sequence 


^^ino acid sequence (A=Alanme C>=<Jysteine, 
>»Aspartic Aci4 &=<jlutamic Add, 
^^Phenylalanine, <j«=GIycinc, H=Histidinc, 
NIsoleucine, K=Lys!ne, l/=Leucinc, 
M=Methionine, N=Asparagine. P«-Pfoline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *«Stop codon, 
/^possible nucleotide deletion, \=possiblc 
nucleotide insertion 














GLAYGHFSYEFSNHRDVWDLQGWVTGNUJfL 

GLrn^TDPQIHSVDQKVFTTNFGKRGIFYFFN 

NOHVECNEICHRLSLTRPSMEKPCKS 


620 


1970 


A 


4606 


1 


2415 


■merlwglfqraqqlsprssqtvvqrvegpr 

KGHLEEEEEDGEEGAETLAHFCPMELRGPEP 

lgsrprqpnlipwaaagrraapylvltaluf 

TGAFLLGYVAFRGSCQACGDSVLWSEDVN 

yepdldfhqgrlywsdlqamflqflgegrl 

EDTIRQTSLRERVAGSAGMAALTQDIRAALS 

RQKLDHVWTOTHYVGLQFPDPAHPNTLHWV 

DEAGKVGEQLPLEDPDVYCPYSAIGNVTGEa- 

VYAHYGRPEDLQDLRARGVDPVGRUXVRV 

GVISFAQKVTNAQDFGAQGVLiyPEPADFSQ 

DPPKPSLSSQQAVYGHVHLGTGdPYTPOFPSF 

NOTOFPPVASSGLPSIPAQPISADIASRLLRKL 

KGPVAPQEWQGSLLOSPYHLGPGPRLRLWN 

NHRTSTPINNIFGCIEGRSEPDHYVVIGAQRDA 

WGPGAAKSAVGTAILLELVRlro»MV&Nvjr«. 

PRRSIXnSWDGGDFGSVGSTEWLEGYLSVL 

HLKAWYVSLDNAVLGDDKFHAKTSPLLTSL 

IESVLKQVDSPNHSGQTLYEQVVFTN\PSWD\ 

AEVIRPLPM\DSSAY\SFTAFVGVPAVEFSFME\ 

DDOXAYPFUiTKEDliJbNunK v Lti^unA^rrv y «. 

OAVAQLAGQLLIRLSHDRLLPLDFGRYGDW 

LRHIGNLNEFSGDLKARGLTLQWVYSARGDY 

IRAAEKUIQEIYSSEERDERLTRMYNVRIMRV 

EFYFLSQYVSPADSPFRHIFMGRGDHTLGALL 

DHLRIXRSNSSOTPGATSSTGFQ>ESRFRRQL\ 

AmTWDACKGAANALSGDVWNIDNNF 


621 


1971 


A 


4610 


793 


334 


ISRVDDFVGSGIANVIlAVAIFSU'AhAJfcU-vi<.u\ 
NTLVLKQQTFIESARSIGASDMTVLLRHILPGT 

GSSIWFl* iMKiu 1 ailoA/VoLiDr j-f\Ji-»vj^v<A * * 

EWGAMLNEARADMVIAPHVAVFPALAIFLTV 
T , AFNLLGDGLRDALDPKIKG 


622* 


1972 


A 


4614 


2 


820 


TVYVMIAIFCIASAMSLYNCLAALlHKIPYCjg 
CnACRGKNMBVRLIFLSGLClAVAWWAVF 
KNEDRWAWILQDILGIAFaNLlKTLKLPNFK 
S(r»m.LGLLLLYDVFFWlTPFITKNGESIMVEL 
AAGPFGNNEKNDGNLVEATGQPSAPHEKLPV 
i/n>\n>in TVP«:VMSVGLMPVSILGFGDnVPGL 
LIAYCRRFDVQTGSSYIYYVSVXTVAYAIGMIL 
TFVVLGVLMKKGQPALLYLVPCTUTA/CQFV 
AWETVREMKKFWERVTS 


623 


1973 


A 


4619 


17 


" 691 


' TLVSWEFVRRADLlTUiDLAPSSVDSGgAU^ 
GOCCESGLPNTMPSAFSVSSFPVSIPAVLTQT 
n\)m?PWLMGLATFHALCVLLTCXSSRSYRLQ 
IGHFLCLVILVYCAEYINEAAAMNWRLFSKY 
OYFDSRGMHSrVFSAPLLVNAMirVVMWVW 
KTLNVMTDLiusAl^fcKKKJiisJSJuw^ vjjv\ 
AAWSLRPSRPPSAAPSAAVCVAWASFQLTHG 

LKNRCFI 


624 


1974 


A 


4622 


164 


668 


- vSCYTALQSIMNQPHSANUPEPLCAVCUgAk 
SLEENHFYSYPEEVDDDLICHICLQALLDPLD 
TPCGHTYCTLCLTNFLVEKDFCPMDRKPLVL 
OHCKKSSILVNKLLNKLLVTCPFREHCTQVL 
QRCDLEHHFQTSQAWGTHL^SQLLGRLRQED 
n.SPGVHHCSEV 


625 


1975 


A 


4625 


474 


473 


- -CTLSPSPaPPLaSSSiSSPSFPLFi'Pi'llXP^iU' 
PPLLIPSS*LSP 
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SEQ ID S 
KO: of > 
nucl- P 
eotlde s 
seq- V 
uence 


BQID 1 ^ 
10: of h 
eptide j 
eq- 
lence 


let S 
od r 

ii 
I 
C 
5 


EQ P 
DNO: b 
1 n 
JSSN 1 
9/496 c 
114 r 

r 
I 


redicted 1 P 
eg^nning 1 n 
udeotide 1 
Dcation c 
onespondi t 
ig to first i 
iminoacid ( 
esidueof i 
peptide 1 
sequence 


redicted end / 
ucleotidc 1 
Dcation ^ 
^rresponding 1 
0 last amino 
Lcid residue 
)f peptide 
sequence 


imbo acid sequence (A»Aianmc i^y sreroi^ | 
>»Asnartic Acid, E=01utamic Acid, 
^^Phenylalanine, G=Glycine, H=Histidm^ 
=Isoleudne, K=Lysine, LpLcucine, 
Vl=Metliionine, N-Asparagine, P«Pioline, 
5=Ghitamine, R=Arginine, S=Serinc, j 
r=Threonine, V=Valine, W=Tiyptophan, 
Y»Tyrosine, X=Unknowa, ♦^Stop codon, 
(=possible iwclcotide deletion, V=possible 
nucleotide inscition ^- 


626 


1976 


k 


1629 


249 


3 


klXGNECFCYHCNVClFLMlKJC*GLI'LC*im 
T i7FFT*SHSFmECSGTISAHCSLQLQGSSNSP 
ASASOVAGIAGTHH .\ 


627 


1977 


A 


4635 




301 


FFFFETKPFFAPQAOGQGPSRUSLNPLFlGLk 
OFSGLTLSRSGNNGPRPPPRVNFGILRGNGVP 
PGGAG*PRPPDLRGPPGLAPPQGGNNGGDPP 


628 


1978 


A 


4648 


1357 


782 


KLFSSORLFGPHIQAINPSFLLLSFFPS*LLAMR 
TVGTWAFILmVYRIVLLLF*HV*PAYFQPSK 
NKTAKINCN*RPFLFLVCYLL*AELHIGIFIANF 
YDCIPNKLNEHLWPKLUJSLIFHVDFCGFUK 
VFYICFTEFIXFLYFL*LFIIKVSCSn*CSTlCVF 

S YKSFAVIIFh VDKi Kf f —I 


629 


1979 


A 


4660 


18 


999 


HHELHTLELLQNPKEVLTRSEIQDVN V SLhAV 
KVKTVCQffLMKEMlJKRFQVAV^n-AEDTA^ 
PKLVFSQEGRYVKNTASASSWPVFSSAWNYF 

AGWRNPQKT AFViiKr i^m-ov^ v ^^^^ VJLl W** ! 

KHYVi^VESRDSLEVAVGVCREDVMGnraS 

KMSPDVGIWATWSMGYWPUtarOTPTXJQ 

EPALmVGVYLDRGTONVSFYSAVDOVHLH 

TFSCSSVSRLRPFFWLSPLASLVIPPVTDRX*G 

FSSPDQNSFPVVQLRimffWAUCTSCLVPG 

WSIFWVSLTVPFGICPLCASQEAVPWEVGLA 

NGDGTuNr rKKr wniri-> 1 


630 


1980 


A 


4669 


2 


358 


-FFFFFETESHSVAQAGMQWRNUiSLPAFmit 
TPFFCLSLLNOWDYRW'IimANFFVLL>nSTG 
FiroVGQDGLDU.TS*STPSASQSAElTGVSHC 
TRtK-KTRFAKGHVEFFFESHVE 


631 


1981 


A 


4674 


953 


"614 . 


•n-IROTDDKHliliCl VQJimGKNlWbAV 

AHTCNPCTLGGRGRWmOSGVQDQPGPTWQ 

NPVFLERRPRAIBSSPQLTTQRILWAQGLWV 

nAfiSTQCSROPROEOVFREO 

■ RSTHASGMlSPSFGFMGHaWJEFIilLreimir 


^632 




TA 


"4678 


"34 " 


pii ' 


ilpSVQGEAAGSSLlSHLQTFSPDUCGVYCTFP 
Asr.T.APVPTHWTVSELSRSPVATATFC 


633- 


1983 


A 


4696 


1 


1365 


■ R^LGMEGERRASQAPSSQU'AUOA^KibM^W 
QAPFPGSSGSSALLQAEVLDIDEDEDDLEVFS 
KDASLMDMNSFSPMMPrSPLSMIN«^^EP 
nr vni TjrrvnFPFSHV'ri'lEl'FITyRHTKTSRG 
EFDSSEFEVRRRYQDFLmiWKL^EAOTT^^ 

gpgllsrmgqtvravassmrgvknrp™ 

EMNmELFSQKINLIDKISQRiyKEEMYroE 

mkeygpihlwsaseedlvdtlkdvascidrc 
ckatekrmsch^eallpwheyvlysemlm 

GVMKRBDQIQAELDSKVEVLTyKKADTOLL 

peeigkledkvecannalkadwerwkqnm 

rtwnTKT AumMAEENlHYYEQCLATWESFLT 
- -^^^TTTOfeVlLlDPl-HKAlRRl^PUly 


634 


1984 


A 


4708 


421 


""158 


WlSKAVYXHREMCCa-TSTGRKSHGLEKDRM 
,^HATfiOSCRAA*RRRKTLQF?CYH 


635 


1985 


A 


" 4709 


42 


341 


--mOPDAKKmTVHWKKKliafcASbllU^^Sl 
PG\^APGHVra)yGRGDNFYlJH*DPGOIVL 
V^^^^FIWIARK^^TlX3EHHBYLffiVFW^^ 

SED 


636 


1986 


A 


4721 


2 


351 


RSTIVrVA*lAIQKIjn.SMNDAy«VIMKMSS 
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NO:of N 
nucl- 1 p 
eotide s 

^ea- i u 

i I 
uence 


EQID N 
0 : of h 
eptide 
:q- 
ence 


let S 
od I] 
ii 
I 

C 
S 


EQ P 
ONO: b 
I n 
JSSN h 
9/496 c 
^14 n 
a 
r 
I 


redicted | P 
Bginning 1 n 
ucleotide l( 
>cation 1 c 
oirespondi t 
g to first a 
iminoacid j c 
esidueof s 
>eptkle 1 
sequence 


redicted end 
ucleotide 1 
jcation J 
orresponding 1 
0 last amino I 
cid residue < 
►f peptide ^ 
,equence 

/ 


Snino acid sequence (A=Aianine G>=Cysteme, 
)=Aspaitic Acid, EF<31utamic Acid, 
^Phenylalanine, G=<51ycine, H^Hisddinc, 
==Isoleucine, K=Lysine, L^Lcucine, 
4-Mcthionine. N=Aspaiagine, P=Pn)line, 
5=Glutai]iine, R=Arginine, S^Serine, 
r=Threonine, V=Valine, W-Ttyptophan. 
/^Tyrosine, X=Unknown, *=Stop codon, 
t=possible nucleotide ddction, \«possible 

aucleotide insertion 

ISPNFNSMDQPLDFQRTLGLRSPCYNRVPAgK. 

K iivrmKnsmN A YOVDSVOST 


raT 


1987 


A. 


4726 ( 


564 T 


253 


NTGLTCSIQRKCGETQLYRREENRl-lLLLvyn 
LKJSESFQVLTLSPRLEFSGLISAHCNLRLPGSS 
DSSASSSRAAOrrGVHHHAWLIFFFl^^ 
UAO* A GLELLTSGDPPASASRSAGITGVSHHA 

RPRFTRFL 




1988 


A 


4734 


24 


592 


GGMDSRVSGTTSNGETKl'VVl'VMEKmUO 

TLERGHWNNKMEFVLSVAGEUOLONVWRFP 

YUrnO^GGGAFFIPYLmFTCGIPVnjjnVU^ 

GOyreOGGVTAWRKlCPIFEOIGYASQMIVn. 

. vkn/vrnn AWAi FYI FSSFTIDLPWGQCYHE 

^T.rn^'-'^l'nKTNGSLNGTSENATSPVIEFW 


639 


1989 


A 


4743 


1040 


699 


AGLELLGSSNLPAAMVSQSAQnGHDHCAWA 

Mr^-vnwrT l/Ti^CrtT OPf^TfPf^ 

TSNHVLYi QliOLKK.VJJvCvr 


• 640 


1990 


A 


4771 


527 


2 


OTPQLKABLLRQPPK'LOLQMESHSCOTAJ^ 

;MRSiaTAmSQVQAIIXPQProiTOS^ 
PDHEQQPLSWVLPPPQKDMNPREQQVALGP 

r^A A AT PWAVWRNDCFPR 

L^gS&mssEi^smPAR- 




"1991 1 


A 


4780 


16 


473 


LQLAASPYFSPSWAECPQPVPAGTHATWCIA 

RVWARMTPPGPAOIPSmPPPPPERSVPIPSP 

FPARDSGSRQGHSTDRYKHTDAPRDAHRRVP 

<^T»T^'TM^Tr•^^4TO^^THTHAHTPPEK 
ORDTDTvj V rl l xjj^J i n i * — - 


642 


1992 


A 


4798 


1 


487 


-"GYSFRCDlVDYSRSPTALKMARICWLYYft^K 
FlELLDTlFmRKJCNSQVTFLHVFHHTIIVffW 
TWWFGVKFAAGGLGTFHALLKTAVHVVMY 
cwm <5AT rtPAYOKYLWWKKYLTSLQLVQF 
VIVAIfflSQFFFMEDCKYQFPVFACnMSYSFM 




643 






"4799 




"191 


OEIPLSEILRISSPRDFTNlSQGSNPHCFEirrDT 
MVYFVGENNQDSSHNPVLAATOVQLDVAQS 
WEKAKQALMPVTPQASVCTSPGQOKDHSK 
n^ASvrrePGOQKDHSKQ 


644 


T1994 - 


A 


4800 


488 


101 


- AmFAVHPVHlBCVAOVVURAYlXCAU;t'iL 
CsFLGYCKAHlESNKEGAHSSlFWVmi^ 
AVAMLCKEQGITVLVRAATWLOPAKVCPFP 

SVimnvoWPCLCGVLHAYIPLLV _ 
" TOmVLCOTPARPQSl MIHLGHDU'LLLi.r V 


645 


1995 


A 


""4805 


458 


126 


AAAOTTPGERSSLPAFYPOTSGSCSGCXJSLSL 
Ar>F.nnKVYINMPORO 


646 

647 


1 1997 . 


A_ 


4817 
1 4854 


47 

|.1044 ' 


1033 
1 335 


- ToOToSlFLa^iSQGVP^^^E^ 

t npOAPGHDMTSIPFPGDRLLQ'VDGVILCGLT 
IKOAVOCLKGPOQVARLVLERRVPRSTQQC 

DGPKF*SSN*KRIANGLQFSFVQMEKESCSffl, 
KSDLVRKRLFPOHPABENGAIAAGpm^RE 
WEGPRKASSSRCRGSWAMQLSVQAGP^AS 
YYPAAVEVLHLLRGAPQBVTLLLCRPPPGAL 
PELEOEWQTPELSADKEFTRATCTDSCTSPIL 
GSRGQLGGTVPPQMQGKAWGLRPESSQKAIR 

pnTunAKTRRDLOPVP ■■ 

— gRVRGDWPLKKKKSNSNlHPlPSWCOSlUSKD 
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SEQ ID 5 
NO: of > 
nucl- P 
cotidc s 

uence 


EQID N 
fO: of h 
eptide 
cq- 
ence 


let 1 S] 
od II 

1 ^ 
U 

0 

9 


EQ P 
)N0: b 
I n 
SSN \ 
9/496 c 
14 r 
a 
I 
I 

! 


redicted p 
egiimisg n 
ucieotide 1 
^cation c 
oirespondi t 
Igtofirst i 
iminoacid 1 ( 
esidue of i 
>eptide 1 
sequence 


redicted end 1 / 
ucieotide 1 
nation ^ 
orrcsponding 1 
0 last amino ] 
Lcid residue * 
>f peptide ' 
sequence 


finino acid sequence (A=Aiaiune OCysteine» 
>=*Aspartic Aci4 EHjlutamic Acid, 
"Phenylalanine, G=Glycine, H«Histidinc, 
=:Isoleucinc, K=Lysinc, L=Lcucinc, 
vI=Methiomne, N=Asparaginc, P=Pioline. 
5=Glutainine, R=Arginine, S=Serine, 
r=Threonine, V=Valine, W=Tryptophan, 
y=Tyrt)sine, X=Unknown, ♦=Stop codon, 
(^possible nucleotide deletion, \ppossible 
nucleotide insertion ■ — 














IVMPTYDLTDSVLETMGRVSLDMMSVqAiN i 

GPPWESKNSTAVWRGRDSRKERLELVKLSRK 

HPELroAAFTNFFFFKHDENLYGPlYKHISFFD 

FFKHKYQINIDGTVAAYRLPYLLVGDSWLK 

QDSrnnEHFmELQPWKHYIPVKSNLSDLLEK 

LKWAKDHDEEAKKIAKAGQEFARNNLMGD 

DIFCyyFQTFPKNMPIYK ^^^^^^^^^ 


"648 


1998 


A 1 
1 


5167 


2030 1 


837 


AGMLPA » trrrkxrT'Trr/^'vrr nA/^VTTT t.NYI 

SEEEEKSGLGAKIPVTirrGYlAjAuK.i ii^i^in xi 

LTEQHSKRVAVILNEFOEGSALEKSLAVSQG 

GELYEEWLELRNGCLCCSVKDNGLRAIENLM 

0KKGKFDYIIXETTGLADPGAVASMFW\n3A 

ELOSDrVlIXjlITIVDSKiGLKJU-Atuira-iJvji-i 

NEATRQVALADAILINKTDLVPEEDVKKLRT 

msmGLGQILETQRSRVDLS^m;DIJHAF^ 

SGISU?KKLQHVPGTQimDQSIVTnroWG 

KAKEfflL>IMFIQNLLWEKNVRNKDNH^ 

IRLKGLVSlKDKSQQVIVQGV«JSLXUi-i»irv 

SWKDDTERTNRLVLLGKNLDKDILKQLFIAT 

VTPTEKOWTTHFKEDQVCr 


649 


1999 


A 


4873 


226 


189 


DGVSLLLPKLGVQWAQYWAHWQPPLPbi^ 
FSCLSLRSSWD*KCAPPHPAFVFLVEMOFHRV 
GQAGLELRTSGDPPASASQSAGITGVSHLA*? 
r^\AV\ A PFORLCVYI 


650 


2000 


A 1 


"4874 " 


2 


"^i7 


-FFFimSFAFVAQAGVQWCULUSPQPLri'ur 
K*FSCLSLPSSWDYRHAPPPCPS*FLYF«*R<^ 
FTMLARLVLNS*PHDLPTSPSQSAEIKGySH^ 
CPASFYLFLKYYLEAKFCA'Ola-AraAOvw 
GYKRGHKSCLLINCWQI 


651 


2001 


A 


4898 


1701 


771 


DAWOPETRLARILNPDSKM'RPOBIJElJiAlK 

PHMEPKASCPAAAHAIBRKFHVLVGVTGSV 

AALKi-PLLVSKlJX>IPGLEVAVVTreRAiaffY 

SPODIPVTLYSDADEWEMWKSRSDPVLHIDL 

RRWADLIXVAPII)AimX3KVASGlCDTJlJUTC 

VMRA WDRSKPLLJrCPAMr>i i am w^^^iri j v 
VDQLKAFGYVEIPCVAKKLVCGDEGLGAMA 
EVGTIVDKVKEVLFQHSGFQQS*PGISVMGVP 
» ^/or!ii/\fr\AVQWiuinvflKlGGYPHLLNGGPA 
1 ST PROOACSRLNWTEOPOLSFFOPGEAAA 


652 


2002 


A 


4927 


1 


611 


- FRGRQTSRPARGFSPWRmilMQEPSSObcrA 
SP*LPCASNRLAFGOIJFPCAPLVPYPAPFSPLL 
PAFSCAPRPRAHIHSRIHPSAPLVPKPSSRAR 
rOSPIPSRASSPSCSWAQVPGVALARCAGVC 
KPGDSWRVAAaSQRCCSRGRRRGSGPRNPE 
ncTOnAWnPSFWGSWKSORELSAGGAQAWP 

T.I.GSAGSGLRGEA 


653 


2003 


A 


4965 


2 


T285 


- -pFFFPDGVSLCHPGWNAV ARSWLTAi^>A5R 
VOAVSCFRLPSSWDYRHATMPG*FF*YF**R 
wrritrrtT ATI vixiQ^pnvirpPWPPKVL'n^QA 


654 


2004 


A 


4968 


3 


437 


- RPGIPGRRFRRSWFCQLP*EPliP^A-t^A*^^,^ 
• TPAVGLGALGYIPPVRVPQRPPTQRSQGRGW 
DPERDPGCRVQVSRGPRFOEQKTPGLQGCLP 
PPCLTHlj\AASC:VVWCGRWKRDSAECQm 
HSCSAVSOOEDRCRSSSCS 


655 


"2005 


A 


4983 


201 


397 


- "MNNNTTCiQPSMISSMALPllViLLCl VUVtUN 
XT «nWTTn.TKIGKKTSTHIYLSHLVTAl^LVC 


656 
657 


2006 
2007 


A 
B 


4988 
1 5008 


332 
129 


159 
\aS5 


- "LVIM^MYRKFFEEEAQASNKHVTRULiSbVA 

piJVWKT^m*HFLPIRLEKNKNNIKD 

- iaAGjBcrASQDVlDSSWELit.V^^ 
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Amino acid sequence (A=Aianine v.=^y5«mi^ 
I>«Aspartic Acid, EKJlutamic Add, 
F=Phenylalanine, CKjlycine, H^Histidinc, 
Msoleucine, KrLysmc, L=Leucine, 
M^Methionine, N=Asparagme, P=Proline, 
Q=Glutaimne, R=Argiune, S=Serme, 
T=Threomne, V=VaHne, W'^Tiyptophan. 
y«=TyTOsine, X=Unknown, *«Stop codon, 
/=possiblc nucleotide deletion, V=possib!c 
nucleotide insertion 




nucleotide inse rtion 

VTXRVTGESmCK}VlI.KlVEgiNRKQDWSUW 
AIWWEQKRQWLUJTHWTLDKYGILADARLF 
T?r.P QHRPVlLRLPNRRALRLX* . 




KIU'SCl^SSWDYRCAl'rMFANrvri.yBtw 
V1 1 TT.«SANLGLSTSLP1FUILLS 
RGHGGKSLTGGTPGKWGUULLVSEDWSHLU- 
^v,o, -i/Qmn rtif WSPnLOGPGLSAVHTWPWL 



RGHGGKSLTGGTPGKWUUUIXV atu w onuir 
rNSLVSPVlXKWSPCLQGPGLSAVHTWW. 
iWcWAVHVKTHMRPGLAVLPRLVLNSWS 

♦AIILLWPPKALGLQA 




HPGHIX:LQPPGGPA»FUJYi;KUU^nrvrv«i 
AGSCPRQKKTTPGPTVLCVCSFWIYQROEPH 

HRTOARWNH 




LSLPNSRDHRHVPPIU-Alh^*Aliiuarir«v 
LELLGSSHPPTSASQSARTTGVSHRAWPLK* 

NLNOYQTLTMN ^ 

ELffe^MOTLAVTGSWAD^ro^ 
i^S(y}iaiALRTLLSQGYTm<AVTLm^ 
LffiAaGDHVACARmEAGANVNAITIDGV 



SPTHEGASKGHHECLDlLlSWOluvuyiiirnov 
TPLYVACMAQQFHCIWNLryAQAGVRKGKY 
W DTPLPGAOHQS TnKT.E*LFAMVHWQ 
-N^S»SFAHCASVmafflYMUOyilH-U.Vi6K. 
ADIJ>EGVAVSGPSPAEFCRKHRLPAPVPFSC^ 
GPAEPSTnFTQLATMAAFPHLVHABUffSSF 
P^., T ^.wr.A AVA AVLSFSLYRVLVKSQ 

QLlJVmXLYLFTUiTNAmSTmJJRAlJHW 
xWi An scsEICYTFVIVPKMLVDLLSQKK 



TISia.GCAIQMFSFLFFGSSH5FUJV>«««i^iV 
YMAICNPLRYSVLMGHGVCMGIAIAAAWAC 

fiPTVSl.VTTS LVFHLPFHSSNO HE 
OOYHNTGSAGHHAHCQVGHSKHVHKPSUUj 
-•-IQRGLPSFNSLEGHSLKDSCfflEESVQLDffi 
«nD<!i vrnTAVNDVLNTSVTSMGSQMPD 




KRTFATFOKDVSDHPAffiRtTUiUiu^' y i 
ICSPKVNSVIREAONOCBAISPVTSPLHLKSSL 

PTKPSVSY EIVPPQrrARRC „■ 

VCTKAOLSGLOLQPCLYKRR 

DLTNSHFFLFDFQKTGPPLOePKAQFSSLQW 

SS^wv(jlK■lAVKYF^-*K^^til>£N^^ 

_ . . . ^rA.iT T cTpn AfinPPVIjGHTV 



RALPTTFADIENLKYLLFTODASQPra^ 
TFGDLEYVTVEGGIVLSREIA1KRLNR1JJ5NSE 

TCADQSVIWKLSEDKQLAICXKYAOVH^ 
EDYEGRDVFNTKPIAQLIEEALSNNPQQWBG 
^r^n^A mTMP.! TPnKMEVMMYGLYRLRAF 



EDYEGRDVFN l wiAViuicurtw w 
CCSDMAITFNGLTPQKMEVMMYGLYRLRAF 

r^tJVTDjnTT .VCT.PPVGSEND 

■pGRPTRPPLLlLLAHVSPEPAOmiffiLAgi'O 

AfiSUcHPPi I rosocLSEPVPosHOPr 
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SEQID 
NO: of 
pq)tide 
seq- 
uence 



Met 
hod 



2021 



2022 



SEQ 


Predicted 


ID NO: 


beginning 


in 


nucleotide 


USSN 


location 


09/496 


correspondi 


914 


ng to first 




amino acid 




residue of 




peptide 




sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



2023 



2024 



5105 



5148 



672 



72 



5152 



5153 



2025 



210 



400 



314 



335 



2953 



5154 



599 



1880 



Amino acid sequence (A«Aianme OCystein^ 
D=»Aspartic Acid, EFGlutamic Acid. 
F=Phenylalanine, G=<}lycinc, H=Hisddmc. 
Hsoleucine. K=Lysine. LpLeucine, 
M=Methionine, N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S«Serinc, 
T=Thrcomne, V=Valine, W=Tiyptophan, 
Y«Tyrosine, X»Unknown, *'==Stop codon. 



nucleoti de insertion „ 

DEL*NMNGRVDYLVTEEE1NLTRGPS(jLU1'N1 

VGGTDQQYVSNDSGIYVSRIKENGAAALDGR 

LOEGDKE.SVNGQDLKNLLHQDAVDLFRNA 

GYAVSLRVQHRLQVQNOPIGHRGEGDPSGIPI 

PMVT .VPVFALT MVAAWAFMRYRQQL 

RDGREELCWQEPTLPSRlCSSAPlXm^^ 

VLLLLLUSlLCLYWKARKimRSiraiKEKA 

TWVn T.KEAGGVTTNRMED*EEDECN 

IIYFSYNIFLKrrELLNDVKRLKQALNCjLSQLT 

YTSGNFTKRQSQLIDTLQHQVKSLEQQLAVS 



NHFVEVT 

LTEDQPFDILQKSLQEANliEQTLAEEAYLUA 




OTLOPIGVmVFVCjASt^A:iJNi vu vynw* ivav^ 
VGISVPSQHLSNSSQISGSGQIQLIGSFGNHPS 
, « ^^TvirWcnTn vnsnmAPSNVSGGIXV 




GQQNTYNVNNLOig^^ttnv\^iAuorrtoi^u|*^ 
GSWGPHMSVNIVNQQNTOCPVTSQ^^^^ 
GSIVIHSPMGQPHAPQSQFLIPTSLSVSSNSVH 
HVOTINGQLLQTQPSQUSGQVASEHVMLNR 
NSa^RTNQPYTGPl^NNQKrAVHLySGQ 
. . or« rx A xTu A cpHT vnnoMPLOOASPT 




VLHLSPGQSSVSQiJKrorAiivirov loin^vj* 
FPAVSSACTAHPSLGSAVQSGSSGSNFTOTQL 
r^r^xm^T^\T<imi WSSRKSTSTFSNTPGT 




OEKWGSSPGHrAVVi vcDnouuv^ x^^^x 
LTKGAFILQQLQRDQAHTVTPDKSHFRSLSD 

AVQRLLSYHVC<»SMPTEEDUUCVD1^FCT^ 
ATQLLKRTQAMLNKYRCLLLEDAMRI>^PAE 

MVMIDRMn^QEE^ 
ADFCCSFKLDKAAHETQFGRSDQHGSKASSS 

LOPPAKAQGRDRAKTGVTEPMNHDQFHLVP 

iJWsABGNISKmCLGRAUOT^^^ 
YOSTSEEKASRREPLKASQCSPGPEGHRKT^ 
«oktjnTc«vT 5Rn .AnSHLEMTCNNSFQDKSL 




NSILEG — — — ~— 

LKKMEPFSCUTFVALFPArVD NRIIFGKNI^UK 

LYDEVQEVVYFPAVVHDNLGERLKCTYIEID 

OVPCTYAWLSRPAWLWGAEMGANEHGVCI 

ONEAVWGREEVCDEEALLGMDLVRLGLERA 

otabkaliwwllekygqggnctegrmvf 

SY^OTJADRNEAWILCT^^ 

GxKw^rmiAREHPD^ 

WDGKKEFDFAAAYSYLDTAKMMTSSGRYCE 

GYKLLNKHKGNITFETMMEILRDKPSGINME 

GEFLTTASMVFILPQDSSLPCIHmGTPOTER 



IfflOEIDRRHPLYQKHWAJ-llV VINlNlNCIilvruNa 

MLDNMRKIJEKH-FREMESILQNKHLDVEKIV 
NLFPQCTKDEIOIYQSNLSVKV SS 



224 



wo 01/57188 



PCT/USOl/03800 



SEQID b 
NO: of 1^ 
nuci- P 
eotide s 
seq- u 
uence 


EQID N 
0:of h 
eptide 
sq- 
ence 


[et S 

od n 

ii 
I 
0 
9 


EQ P 
DNO: b 
I n 
fSSN I 
9/496 c 

14 E 
8 
I 

I 

j 


redicted ^ 
cginning n 

UClCULlUw ■ 

^cation c 
orrespondi t 
ig to first i 
imino acid ( 
esiduc of i 
jeptidc 
jequence 


Predicted end / 
ucleotide I 
:>cation ' 
orresponding 1 
0 last amino 1 
Lcid residue * 
)f peptide 
;equence 

> 


onino acia sequence (A=Aianmc l^Cysteine, 
>=Aspartic Acid, E=Glutamic Acid, 
'^Phenylalanine, 0=Glycine. H^Histidine, 
=Isoleucine, K=Lysine, L=Leucine, 
vl=Methiomne, N»Asparagine, P=Proline, 
5=Glutamine, R=Arg^nine, S=Serine, 
r=Threonine, V=Valine, W=Tryptophan, 
/^Tyrosine. X^Unknown, *=Stop codon, 
f=possible nucleotide deletion, V^ssiblc 


676 : 


i026 J 




5155 : 


I 


306 


FFFLRRSLALSPRPDCGLQWRNLGSLQAPPTO 
FTPFSCLSLPSSWDYRRPPPRPANFLYF**RRG 
FTLLARMVSIS*PHDPPASASQSAGrrGVSHRA 

FFHSVDLLALHQSKTFYKPDWFDIVESEVKCiJ 
KEAVCVIDMSSFTEFEITSTGDQALEVLQYLF 
Sl^LDWVGHmrrGMLKEGGGYENDCSIAR 
LlsnCRSFFMISPTDQQVHCWAWLKKHs^m^ 

^rt r r r>T\\rmTWTAl MT TfrPRAVDVLSELSYA 

PMTPDHFPSLFCKEMSVGYANGIRVMSMTHT 
r,RPr,FMLYIPlEYRWGFTMLSTLVSNS 


677 


2027 


K 


5167 


97 


740 


678 '"■ 


2Q28 


A 


5183 


1919 


2018 


PALCRLRDDMTVCVADFGLSKKIYSGUYYKVJ 
GRIAKMPVKWlAIESLADRVYTSKSDWAro 

VrMWEIATRGMTriru Y Vi^"*^^ * *^ * 
MRI KOPEDCLDELCKI**SPQSP 


[679 






5190 


39 


499 


RESQVKHFK^^ 

KHPPENIIDGNrETr W 1 1 luivirrviiriiv^A * 
VRIERLVIQSYFVQILKIEKSTSKEPVDFEQW 
EKDLVHTEGQLQNEEIVAHDGSATyLRFnVS 
AFDHFASyHS VbAbU 1 w jda^ j j 


680 


2030 


A 


5204 


541 


92 


-mAViXLACGDlSLNAl^MVAiym^^ 
LLICLSYLFILSAlLRVPSAAGRCKAFSnS^ 
RTVVWFYGTISFMYFKPKAKDPNVDKTVAL 

rj^ X « rrr-nOJ XTOTTVCl "DTVl APVK AAVLTLLRGG 

1 T RRK ASHCYCCPLPLSAGIG 


1 681 




A 


5207 


10 


247 


VPDNGDVTKLPVCSTLVBHl'SLTVSEAMbyM 
T^n-ocT TtnTx AUTrMT<;TLGGRGRWIT*GREF 
r^-reMANMVKPCLYRK 

- T^T^T^n-rco vcrrn A f;vnWPNLSSLKTLPFGFK*F 


682 


2032 


A 


5210 


2 


231 


SCLSLPSSWDYRCLPPCPANFOFSRNGVLPC 
WPGWSRTPDLS 


683 


2033 


A 


5218 


85 


402 


CPSVSGLlKSDLRRHNINlGlTNVDVKAVSiNir 
MIILLRSMYRIN\^YFH*LFPSR^ 
vAnovTTTT TP*T v\ *TPADSPTDOEPKTVMLSK 

nsp<?Al 


684 


2034 


A 


5220 


1 


194 


■ ^^.MKEMQN]^SE^On^TWEEYKJUIK^^^^ 
YG»ALNVIKMAVLmAIVRFSATLVKIPQHL 

TDS 

- -V tTQnnrM<gnPRyPCXjEMSAHAFPVQTC;GfcmJ 


685 


" 2035 


A 


5228 


260 


440 


n«-trTPQVPINFTELSKCS*S*KIMSGERE 


686 


" 2036 


A 


5239 


" 79 


" 508 


- GGEAAAIUVAKI^SPia'HRVOkKKKOVOUWi 
AFSEAALEKKimSNSQQSVQTI^^WHHR 
KMSRPIVTVTVEREUUtAKHWlXmYLAJ© 
VIQNSKRKGPEFTKDFAPVIVEAFKHVSSETD 
PSrKKHLGRVLSIWEERS 


687 


2037 


A 


5244' 


1 


428 


- ^MAWAATALKGRGAW^IARV1J^0IL^A1A 
NKASHNRTOALQSHSSPEGKEEPEaSPEIJETa 
PRKRGKNPMKAVGLAWAIGFPCGILLFILTKR 
t...».rTvn'irv/-tx>fvAi>rk>j\/niI wrOKYESORFR 
ARSf)RAPSPDVGSGVOT 


688 


2038 


A 


5249 


1 


1407 


- LQCyim)KSLLNQGSSSEEVAUSSgU«^yHrf 
SGDSDLATALHRLSLRRQNYLSEKQFF>^ 
QRKIQVLADQKEGVSGCVTPTESI^LCTTQS 
EfTOLSSASCLRGFMPEKlXJIVKPI^QTLy 
HWQQLAQPNLGTILDPRPGVnKGFTCM^ 
AIVMSDLEEDEEEGnTQVQQELEVEEKLSre 
KPVTGDFLPPITSAGGPVTVATANPQKCLSCr 
NSTFTFTTCRILHPSDrrQVTI>SSGFPa.SO^ 
GSSSSNTAVNSPALAYRLSIGESrrNRRDSnT 
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SEQID 5 
NO: of 1 
nucl- ( 

seq- ' 
uence 


JEQID I 
^0: of \ 
)eptide 

aence 


lod I 

i 
\ 


SEQ I 
DNO: \ 
n I 
JSSN 1 
09/496 
914 


Predicted 1 
)eginnmg i 
lucleotide 1 
ocation < 
:orrespondi 
ng to first 
amino acid 
residue of 
peptide 
secfuence 


Predicted end i 
lucleotide 
ocation 
MJiresponding 
to last amino 
acid residue 
ofp€?Jtide 
sequence 


^^d^io add sequence (A=Aiamne G=Cystcme, 
D=Aspartic Acid, E=Glutamic Acid, 
^^Phenylalanine, G«Glycine, H^Histidinc, 
Nsoleucine, K«Lysinc, L«Uucine, 

W=Methl0ninC, N— ASparaglUC, r riuuu«o 

Q=Glutamine. R=Aiginine, S=Serine, 
T^Threonine, V«Valine, W=Tryptophan, 
Y^Tyrosine, X=Unknown, ♦^Stop codon, 
/=possible nucleotide deletion, \ppossiblc 
nucleotide insertion 














FSSTMSLAKLLQERGISAKVYHSPISENPLQFL 
PKSLAIPSTPPNSPSHSPCPSPLPFEPRVHLSEN 
FLASRPAETFLQEMYGLRPSRNPPDVGQLKM 
NLVDRLKRLGIARWKNPGAQENGRCQEAEI 
GPQKPDSAVYLNSGSSLLGGLRRNQSLPVIM 
nSFAAPVCTSSPKMGVLKED 


689 


2039 


A 


5254 


2 


2621 


LSLFGSRALGRSGARAMAKAKKVGARRKAS 

GAPAGARGGPAKANSNPFEVKVNRQKFQILG 

RKTRHDVGLPGVSRARALRKRTQTLLKEYKE 

PJDKSNVFRDKRFGEYNSNMSPEEKMMKRFA 

LEOQRHHEKKSIYNLNEDEELTHYGQSLADIE 

KHNDIVDSDSDAEDRGTLSGELTAAHFGGGG 

GLLHKKTQQEGEEREKPKSRKELIEELIAKSK 

OEKRERQAQREDALELTEKJLDQDWKEIQTLL 

SHKTPKSENRDKJKEKPKPDAYDMMVRELOP 

EMKAQPSNRMKTEAELAKEEQBHLRKLEAE 

RLRRMLGKDEDENVKKPKHMSADDLNDGFV 

LDKDDRRLLSYKDGKMNVEEDVQEEQSKEA 

SDPESNEEEGDSSGGEDTEESDSPDSHLDLES 

NVESEEENEiCPAKEQRQTPGKGUSGKERAG 

KATRDELPYTFAAPESYEELRSLLLGRSMEEQ 

LLWERIQKCNHPSLAEGNKAKLEKLFGFLLE 

YVGDLATDDPPDLTVIDKLWHLYHLCQMFP 

ESASDAlKFVLRDAMHEMEEMIETItGRAALP 

GLDVLIYLKITGLLFPTSDFWHPWTPALVCL 

SOLLTKCPaSLQDWKGLFVCCLFLEYVALS 

QRFIPELINFLLGILYIATPNKASQGSTLVHPFR 

ALGKNSELLWSAREDVATWQQSSLSLRWA 

SRLRAPTSTEANHIRLSCLAVGLALLKKCVLivi 

YGSLPSFHAIMGPLRAaTDHLADCSHPQELQ 

ELCQSTLTEMESQKQLCRPLTCEKSKPVPLO- 

FTPRLVKVLEFGRKQGSSKEEQERKRLIHKHK 

REFK.GAVREIRKDNQFLARMQLSE1MERDAE 

BTITTIKVKOLFNSLATQEGEWKALKRKKFKK 


690 


2040 


A 


5261 


1 


304 


FFFFVFLVETGFHHVGQAGLELLTSGDPPl W 

ASQSAOITGVSHCSWPVIYVLSTLLHAVRNVL 

FKRTFPLKSSSFLSYDKEIFPEiVLKPYLVTLT 

SFVK. 


691 


2041 


A 


5270 


3 


158 


■ IjCmTHCTANVA^PG'rPPGWKIDGPAAAL 
KVLSSFFFFFLKFSYKPQNIV 


692 




A 
A. 


5282 


56 


1268 


" GMEPVGCCXiECRGSSVDPRSTFVLSNLAliVV/ 
ERVLTFLPAKAlIilVACVCRLWRbCVKKVi^tt 
THRSVTWISAGLAEAGHLEGHaVRWAEEL 
ENVRILPHTVLYMAUob ittloLctaK^iSAjnri^u^ 
KRTSMETALALEKLFPKQCQVLGIVTPGIWT 
PMGSQSNRPQEEIGESGFALLFPQIEGIKIQPF 
HFIKDPKNLTLERHQLTEVGLLDNPELRWLV 
FGYNCCKVGASNYLQQWSTFSDMNIBLAGG 
OVDNLSSLTSEKNPLDIDASGWGLSFSGHRI 
OSATVLLNEDVSDEKTAEAAMQRLKAANIPE 
HNTIGFMFACVGRGFQYYRAKGNVEADAFR 
KFFPSVPU^GFFQNGmGCDRIVTGNFILRKCN 
PYKOPPT W1SYTTTMALIHLGSSK 


693 


2043 


A 


5301 


" 362 


507 


" EEIKERFOPGLNrtYWYGFlQl^CNRERUiLLr^ 
ArFPTNIVTLCHSIA 


694 
695 


2044 
2045 


A 
A 


5310 
3315 


1 

12S 


204 
1596 


RVtTAINHTLKENLRKFYKOKKDKFLDLKi'K 
KTRAMRRRLNMHEENLKTKKQHRKERLYPL 

RKYAAKA 

" ErRSTAVKSfeVQVCISaUXH:)RWKKAiSJ' 
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SEQID 
NO: of • 
nucl- 
eotide 
seq- 
uence 


SEQID j 1 
NO:of 1 
peptide 
seq- 
uence 


Viet 

tied ] 


SEQ ] 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (AeAlanine t>=cystcm^ 
I>Aspartic Acid, EpQlutamic Add, 
F=Phcnylalanine, G=Glycinc, H«Histidine, 
Msoleucine, K=Lysine, L=Leucine, 
M=^ethioninc, N=Asparagine, P=Proliiie, 
Q=Glutanune, R=Aiiginine, S=Scrine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosme, X^Unknown, ♦-Stop codon, 
/^possible nucleotide deletion, HH)Ssible 
nucleotide insertion 














TGSGKEEGPAPCKQMKLKAAGGPSALNFDSF 

SSLFESLISPIKTETFPKEFWEQKPLLIQRDDPA 

LATYYGSLFKLTDLKSLCSRGMYYGRDVNV 

CRCVNGKKKVL^UsJJuJ^Arl^l- V la^ivtv. 

ATIQFHQPQRFKDELWRIQEKLECYFGSLVGS 

>rVYITPAGSQGLPPHYDDVEVFILQLEGEKH 

WRLYHPTVPLAREYSVEAEERIGRPVHEFML 

KPGDLLYFPRGTIHQADTPAGLAHSTHVnST 

YQNNSWGDFLLDTISGLVFDTAKEDVELRTG 

IPRQLLLQVESTTVATRRLSGFLRTLADRLEG 

TKELLSSDMKKDFIMHRLPPYSAGDGAELSTP 

GGKLPRLDS WRLQFKDHIVLl V LrU vjui^au 

ETQEKMVYIYHSLKNSRETHMMGNEEETEFH 

GLRFPLSHLDALKQIWNSPAISVKDLKLTTDE 

EKESLVLSLWTECLIQW 


696 


2046 


A 


5318 


1476 


742 


LMKXYLEAAELGEISDffirrKLLRLSSSQGTlb-l 

SLQDIDSRLSPGGSLADAWAHQEGTHPKDRN 

VEKLQVLLNCMTEIYYQFKKDKAERRLAYN 

EEQIHKFDKQKLYYHATKANnOTTDECVKK 

YEAn^NKSEEWIRKMLHLRKQLl-oLrl iMi^r ui 

EEEVSKYQEYTNELQETLPQKMFTASSOIKHr 

hOTrVTSSNTLVEMTLGMKKLKEEMEGVVKE 

T.AKNNHILESGGSLTMDGGLRNVDCL 


oyi 


2047 


A 


5320 


244 


478 


LDYNFFLFEMTFGLVSQAGVQWHDLGSLQFf 
PPGFKQFSCLSLFSS WD I KjiJL*rrrii-/uN r olsji vj 
VSPSWPGWSRTPDFR 


698 


2048 


A 


5324 


"266"^ 


714 


"LPIRKSLRSVRSGFPTSQSPrrRNLDGTASUSU 
LAKTVTGSLFRINVGLRGLVAGGUGALLGTT 
VGGLLMAFQKYSGETVQERKQKDRKALHEL 
KLEEWKGRLQVTEHLPEKIESSLQEDEPENDA 
KKIEALLNLPRNPSVIDKQDKD 


699 


2049 


A 


5334 


699 


277 


RPHGHLVCISSSAULSU V NUL.au I W\OJvrrua 

FGFAESVFVETFVQKQKOKTnVCPFFIKTGM 
FEGCTTGCPSLLPILEPKYAVEKIVEAILQEECM 
YLYMPKLLYFMMFLKSFLPLKTGLLIADYLGI 
T.HAMDGFADOKK 


700 


2050 


A 


5344 


3 


614 


^ITAEEMSSLTPESSPELAKRSWFGNFISLDKIiJti 
QIFLVLKDKPLSSIKADIVHAFLSIPSLSHSVLS 
QTSFRAEYKASGGPSVFQKPVRFQVDISSSEG 
PEPSPRRDGSGGGGIYSVTFTLISGPSRRFKRV 
VEnQAQLLSTHDQPSVQALADEKNGAQTRP 
AGAPPRSLQPPPGRPDPELSSSPRRGPPKDKK 

LLATNGTPL 














701 

702 


2051 

2052 


A 
1 ^ 


5346 
5356 


3 

2502 


1383 
"T540 


" "HAS VLFCRVMAASKTQGAVARMQEDRIXiSL 
STVGGVGYGDSIODCIIJEPLSLPESPGGTITLE 
GSPSWCIFCEEHFPVAEQDKLLKHMnEHKIV 
lADVKLVADFQRYBLYWRKRFTEQPrrDFCSV 
nUNSTAPFEEQENYFLLCDVLPEDRDLREELQ 
KQRLREILEQQQQERMmOTIGVCMFCNEEF 
LGNRSmNHMAREHAFNIGLPDNIVNCNEFL 
CTXQKKLDNUJCXYCmTFRDKKnJOJHm 
KKQHRKINPKNREYDRFYVINYLELGKSWEE 
VQLEDDRELLDHQEDDWSDWEEHPASAVCL 
FCEKQAEnEKLYVHMEDAHEFDLLKIKSELG 
LNFYQQVKLVNFIRKQVHQCRCYOCHVKFKS 
KADLRTHMEETKHTSLLPDRKTWDQLEYYFP 
TYENDTLLWTLSDSESDLTAQEQNENVPnSE 
DTSKLYALKOSSILNQLLL 

- ■ MAAATRGCRPWGSUXJLU}LVS^^ 



227 



wo 01/57188 



PCTAJSOl/03800 



SEQID S 
NO: of ^ 
nucl- P 
eotide s 
seq- ^ 
uence 


EQID h 
10: of h 
eptide 
eq- 
lence 


let S 
od I 
i 
I 
( 
c 


EQ 1 
DNO: b 
a n 
JSSN I 
)9M96 c 
)14 1 
J 

: 


redicted I 
«gtnning r 
tucleotide 1 
ocation ( 
»nespondi 1 
ig to first 
unino acid 
•esidue of 
peptide 

sequence ^ 


'redicted end V 
tucleotide ^ 
ocation ^ 
:orresponding 
to last amino 
icid residue 
of peptide 
sequence 


^ino acid sequence (A^Aiamne OCystcme» 
>=Aspartic Acid, &<jlutamic Acid, 
?=Phenylalanine, GKjlycine, H=Histidine, 
=lsoleucine. K^Lysine, li=Uucine, 
M=Methionine, N-Aspaiagine, P=Pn)line, 
[3=Gliitamine, R^Aiginine, S=Serine, 
r=Threonine, V^Valine, W«Tiyptophan, 
Y^Tyiosine, X^Unknown, ♦^Stop codon, 
/=possiDie nucieoucic aeicuuii* v-|w»'i'**' i 

nucleotide insertion J 

LASLRCTLOAt'CECDFRPDLPGLECDLAQHL 1 














AGQHLAKALVVKAIXAFVRDPAPTKPLVLSL 
HGWTGTGKSYVSSLLAHYLFQGGLRSPRVH 

HFSP VLHFPHPSHIER I l^Kl'l-JS^ w v v^uni^ 1 1\ 

CGRSLFUTDEMDKMPPGLMEVLRPFLGSSWV 

VYGTNyRKAFIFlSNTGGEQINQVALEAWRS 

RRDREEILLQELEPVISRAVLDNPHHGFSNSGI 

MEERLLDAWPFLPLQRHHVRHCVLNELAQL 

GLEPRDEWQAVLDSTTFFPEDEQLFSSNGCK 

TVASRIAFFL J 


703 


2053 


A 


5380 


278 


657 


LFLQKlJyvlKTEEEARTm'KlEMFLRKEQi^Kl. 1 
EERLEFWMEKYDKDTEMKQNELNALKATKA 
SDLAHLQDU^KMIREYEQVIIEDRIEKERSKK 
KVKQDLLELKSVIKWAWWRGTMIRRBIGGF 


704 


2054 


A 


5381 


1 


1003 


FRGRAVKMAAWEVEVGGGAAGERELDbV 

DMSDLSPEEQWRVEHARMHAKHRGHEAMH 

AEMVLILIATLVVAQIXLVQWKQRHPRSm 

MNTTIJQMWVVPLYFTVKLHWWRFLVIWILF 

SAVTAmTRATRKPLVQTFPRLVYKWFLL^ 

KISYATGIVGYMAVMFTLFGLNLLFKIKPEDA 

MDFGISLLFYGLYYGVLERDFAEMCADYMA 

STIGFYSES(M»TKHLSDSVCAVCGQQIFVDV 

SEEGIIENTYRLSCNHWHEFCIRGWaVGiaC 

OTCPYCKEKVDLKRMFSNPWERPHYMYuv^i- 

1 nWT.RYLVAWOPVnGWQGINYlLGLE 

lYDRDPLQLATRAGQPLDlNMAGEPKI'YKPKF j 


705 


2055 


A 

A 


5395 


3 


675 

• 


GNKRPLSALYRLESKEPFLSVGGYVJ?U Y uxt 
RDDFYNRLFDYHGRVPPPPRAVIPLKRPRVA 
VTITRRGKGVFSMKGGSRSTASGSTGSKLKS 

DELOTIKKELTQIKTKiDbVlAtKi-ujsjiaR.w^ 

AEAEAQKKLLEESLVLIQEECVSEIADHSTEEP 

AEGGPDADGEEMTDGIEEAFDEDGGHELFLQ 

TV 1 

GRVGLNLEGRGCSBPKWKHU I r i Wi^ioyi-'t»A 
<5 1 


^706 


'"2056 


""A 


" 5410 


"I 




707 


2057 


A 


5415 


6 


287 


' l^FKLTPSFLSlCU^SSGQERKVFIELNHIKKCNl 
VRGVFVLEEFGNYTILLLGLDSHGSNSNLGAP 
PPHT fiAGRKRTSVEKSGGAGVTWCKRDP 


n/\9 
70o 




A 


5423 


" 3 


291 


' SSSNPLGSPSTLWKLCSFVLHNKSCCCi>WU5 

nwr^ n A m T\7D vmPTPPVS5KTTNPLGRTNNS 
XPTLRAITLTYKVUuririivo^i Ai^*-*-^*^**^' i 

nrnFKTVTLTARSTASLLKSVRPRTHQKE 


709 


2059 


A 


5424 


679 


347 


- RjRHEEKRGSRGRGRRTSEELXlPKlflKKHRUU 
SEFTDmSVHPSDVLDMPVDPNEPTYCLCHQ 
^rcvonk/nnrnxrPnrPTRWFHFACVDLTTKPK 

ftifWFrPRCVOEKjRKKK 


710 


2060 


A 


5442 


1073 


559 


- QESLKKKlQPKLSLTLSSSVSRONVSTPPRt^S 
GSLTPPVTPPITPSSSFRSSTPTGSEYDEE^pY 

_„„,«~T»-.ri A TOOT" A 11 OC\£f^/[MtZaW.lC \ 

EESDSDESWTIESAISSEAILSSMCMNtiuiiiiR. 
PFACPVPGCKKRYKNVNGKYHAKNGHRTQI 
RVRKPFKCRCGKSYKTAQGLRHHTINFHPPV 

SAFTTRKMOO 


711 


2061 


A 


5449 


1 


319 


- GDSLCVPQYNKYREERVlLFLKMASGHAi'wr 
DLVKRIRDAIRMGLSARHVPSLILETKGIPYTL 
NGKKVEVAVKQIIAGKAVEQGGAFSNPETLD 

l.YRDIPELOGF 


712 


2062 


A 


5499 


91 


749 • 


" RPTPGHGDFWMQPLTlU^AUMSbSSVTLASAL 
OVRGEALSEEEIWSLLFLAAEQLLEDLRNDSS 
DYWCPWSALLSAAGSLSFOGRVSHIEAAPF 
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"SEQID" 
NO; of 
peptide 
seq- 
uence 



Met 
hod 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



2063 



2064 



2065 



5506 



Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 



' >i«iicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
ofp^tide 
sequence 



5514 



5526 



5529 



5531 



5586 



25 



458 



311 



220 



460 



88 



"720 T^OTO A 



"72i r207r 



"IS I 2072 



5598 1 



5628 



5632 



5638 



330 



798 



146 



148 



Amino acid sequence (A-Aianme C-Cyslemcs, 
I>=Aspartic Acid, B=<jlutamic Acid, 
F=Phenylalanine, G=Glycine. H=Histidine, 
Msoleucine. K«Lysine, L^Uucine, 
M=Methioninc, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, W^Tryptophan, 
Y=Tyrosinc. X=Unknown, *»Stop codon, 
/possible nucleotide deletion, Nppossiblc 
nucleotide insertion 



3806 



KAPELLQGQSEDEQPDASg^ai^VYSLGMlLY 
WSAGFHVPPHQPLQLCEPLHSILLTMCEDQPH 
RRCTLQSVLEACRVHEKEVSVYPAPAOLHIR 
vrTLVLGTISEVSREPCFSSSSCWSCVAIKI 



'VEELDuVSRLUFHLHTPMYFFLAHLSbUJLShT 
TSSIPQLLYNLNGCDKTISYMGCAIQLFLFLGL 
GGVECLLLAVMATORCVAIOCPLHYMVIMN 
PiaCRGLVSVTWGCGVANSLAMSPVTLRLPR 
rnHHKVDHFLCEMPALIRMACISTV 



ASPYWCENNiiGiG^^ 
RLTSSVSCALDEAAAALTRMRAESTANAGQS 

DK 



KVTAPRRPQRYSSGHGSUNSSVLSGELFt'AJVl 



GRTALFHHSGGSSGYESLRRDSEATGSASSAP 
DSMSESGAASPGARTTISLKSPKKRATGLQRR 
RLIPAPLPDTTALGRKPSLPGQWVDLPPPLAG 
SLKEPFEIKVYEIDDVERLQRPRPTPREAPTQG 
LACVSTRLRLAEiaiQQRLREVQAKHKID£EE 
LAETQORLMLEPGRWLEQFEVDPELEPES/^ 
YLAALERATAALEQCVNLCKAHVMMVTCFD 

ISVAASAAIPGPQEVDV 



SPGYGENiai'VlSXNIAVPLCHMNKJ^S^ ^SU 
SSSSERTMDLVl^MCNTNSIHWCGISGRQLG 
KLHPSSSLCXALTLLSSVQGLQSISGLRLTDTF 

LKRTYE YDPIAQVCV ' 

NSEDU-KYFNFESWQEDLDNMVLUIPR^W^ 

RSYHDRKSKWLDRLNDDAKRYSCTPRhnrS 

VNIREELKLANVVFFPRCLLVQRCGGNCGCG 

TVNWRSCTCNSGKTVKKYHEVLQFEPGHIKR 

pf;p ^^.TK^AT vnTOT.nHHERCDCICSSRPPR 

AVLKNMAPMTALGLLDLHILNLILFLSAGEDF 

TSWSElMMmLVFLTLWLLlEMnrCYRKVS 

KAEEAAQENA 



KNCANEAVVQKILDRVLSKVDVRLRPNhU5M 
LATNSTRGLNEDELMAHGQEKDSSSESE0SC 
PPSPGCSFTEGFSFDLLNPDYVPKVDKWSRFL 

FPLAFGL FNIVAAERC TprKTrf^^ 

LPPAQIPEAWLLLANVVVVULVFLUJKLJIJV 



LLLRCKLLPSALQKMALGMFFGFTSVIVAGV 

WOIPQYLLIGISEIFASIPGLEFAYSEAPRSMQG 
AnvlGffFCUGVGSlXGSSLVALLSIKiGWLH 

cpkdfgnjnncrmdlyffllagiqavtallf 

vu/iArxUVK RASOGPASHSRFSR DRG 
MSALIVRKLRSAELTLFSHLFlVJLUANVls^ 



KLHETALHHAAKVKNVDLIEMLIEFGG^ 
RDNRGKKPSDYTWSSSAPAKCFEYYTHOT 
LSQLCRVNLRKATGVRGLEKIAKLNIPPRLID 

YLSYN 



cpsldirsevaelrOTc^^ 

TATGEDFRGLSFPRLTQVTOYLLI^WGUS 

LRDLFPNl^mGTRLFLGYALW^l^ 

VALPALGAVLRGAVRVEKNQELCHLSTIDW 

GLLQPAPGANHIVGNKLGEECADVCPGVLGA 

AOTPCAKTTFSGHTDYRaVTSSHCQR^^^^ 

HGMACTARGECCm^CLGGCSQPEDPR^^ 

ACRHLYFQGACLWACPPGTYQYESWRCVTA 

ERCASLHSVPGRASTFGIHQGSCLAQCPSGFT 

PT.Tc.cpcm7rnnrfT:nTi-;PKECKVGmTIDSlQAA 
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NO: of 
nucl- 
eotide 
seq- 
uence 



SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



rs6Q 

ID NO: 
in 

I USSN 
09/496 
I 914 



I Predicted 
j beginning 

nucleotide 
I location 
I conpespondi 

ng to first 

amino acid 
I residue of 
I peptide 

sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



723 



724 



rioTr 



5672 



1704 4235 



216 



940 



Imlno acid sequence (A=Alamne CM^ysLeme. 
D-Aspartic Acid, E=Glutamic Acid, 
F=PhenylaIanine, CKilycine, H=HisUdme, 
I='Isoleucine,K=Lysine.L^Leacine, 
M=Methionine, N=Aspaiaginc, P^Piolme, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y«=Tyrosine, X^Unknown, ♦"Stop codon, 
/^possible nucleotide deletion, \=TX>ssible 
nucleotide insertion 




AMVDGNYTLYVLDNQNLQQLGSWVAAGini 
PVGKIYFAFNPRJLCLEHIYRLEEVTOTOGRQN 
KAEINPRWGDRAACQTRTLRFVSNJT^^ 
LLRWERYEPLEARDLLSnVYYKESPFQNATE 
HVGPDACGTQSWNIXDVELPLSRTQEPGVTL 
ASLKPWTQYAVFVRAITLTTCEDSPHQGAQS. 
PIVYLRTLPAAPTVPQDVISTSNSSSHLLVRW 
KPPTQRNGNLTYYLVLWQRLAEDGDLYIU^ 
YCHRGLRLITSNNDPRFIXSEDGDPEAElyffiS^ 




RLGGNSSDFEIQEDKWRERAVLSGLRHFTEY 

ADOn-OKVAWEASSKNSVLLRWI^ProroOL 

n^KYEKYRBLQEEATVLCVSKLRYAKFOGV 

HLALLPPGNYSARVRATSLACNOSWTOSVAF 

mOPEEEDAGGLHVIXTATPVGLTUm^ 

LCTFYGKKRNRTLYASVNPEYFSASDMYVPD 

^«nrvroTTnf?T AsnnQTjnMWP.fiLARGLE 



AGEESTPVALNTVNliLAoriuiVij-r , 
KAFKCHHWRLLGWSQGQPTLVIMELMTR 

_ . oxmr^xn vPVKmnfiVLEELEGC 





s»^sSg;^s^^^^ 

TPRDCSPQNGOTGH =?^75?r 

EEPQFPPPSVKIPMESVQSDPQNGIHCIARKR 
SSSWSYSL ^— 

argrrsrpvwaaswggrgrpaarrkpruia 

ATMGI^KFIXJDVDFDLKCALCHKVLEDP 
LTTTCGHVFCAGCVLPWVVQEGSCPARraGR 

>A^(X)LPEHLERCDFAPARCRHAGCGQVLL 
RRDVEAHMRDACDARPVGRCQEGCOLPLTH , 
GEQRAGGHCCARALRAHNGALQARLGMJK 
Al^KEAUUGKREKSLVAQl^Ql^LQMT 
7r!^i;-r^.-™vcADi nsT cnrVAAPPGGKGE 



inXSLTLVUiRDSUslAjr NuuvjivTij » 

SSSEOIFVSKIVDSGPAAKEGOLQIHDRIIEWI 

llSwDQAN^TAKEPIWQ^nLm 

PRTKMFrPPSESQLVDTGTQTDnFEHIMALT 

1^^ISPVLDPYLLPEEHPSAHEYYDPNDY1 

^rl^f^rTii,, ni CTjvni VBMNSODKLGLT 




A T^/vrrk A n\n r.TPVnF.rRRFRELLELK 




ESWMLHNSGFRNYmsiuviavn^j^i^^ 
KSDKDSSSAYNT ^-^^f^STPLTLElSPPKSLRR 



230 



PCTAJSOl/03800 

WO 01/57188 



^EQID SI 
NO: of N 
nucl- P< 
eotidc sc 
seq- u 
uence 


iQID 1 M 
0: of h( 
5ptidc 
jq- 
ence 


[et SI 
3d E 
in 
V 
0 
9 


SQ P 
)N0: b 
n 

SSN U 
9/496 0 
14 n 
a 
r 

F 

s 


redicted P 
sginning n 
acleotide k 
>cation c 
orrespondi t 
gtofirst a 
mino acid c 
esidue of s 
>eptide 
equence 


redicted end A 
ucleotide C 
)cation F 
orresponding 1' 
3 last amino ^ 
cid residue C 
»f peptide 
equence 

/ 
I 


55b acid sequence (A«Alanme OCVsteine, 
►=Aspartic Acid, EpGlutamic Acid, 
^Phenylalanine, <H}lycinc, H«HiStidme, 
=Isoleucine,K=Lysine,LFLeucine, 
4=Methionine, N=Asparagine, P=Prolme, 
MSlutamlne, R=Arginine, S=Serine, 
'=Thieonine, V-Valine, W=Tiyptophan, 
f«Tyrosine, X^XJnknown, *='Stop codon, 
=possibte nucieonac deieaon. vtjwbwib 

mcleotide insertion . — 

^AEGISCPSSEGAVGTTEAVUyASKNLLSiib 














DPBVGTmSPSLKELDPNQPLESKERRASDG 

OWfQSYMQLIQQKSAVEYAQSQMSLVSMC^ 
DLSSmSEPRMEWKVKIRSIX}TOTOKR^ 
DRLLRERALKIREERSGMrTDDDA\^>^ 
GRyV^KEERICQHLVKAKEQRRIUlEFMK«SR 

J^^TWMnQQJLTHGTXSPDGTRVYNSF 
{^?SSvSEAFVANUM^KILyVKV(jiaUA^ 


725 


2075 


A 


5707 


3 


1770 


dSd?wkgrfdflmscvgyaiglonvwrwy 
ottstoglgvwklapmfkgvglaaavlsfw 

^RCFSNYSM\WTOMrSAVVEFWraN 

Ktogi^kpgq^pl^^ 

^G^CIWWKUWSFFWnVAGVFIFSAVQ 
M^mMG^^rVFPI^WGQGVGWLMALSSMVL 
SAS--nJtOSLKQWQVMVQPSEDIV 
opHNfiPROPOAGSSTSKBAYI 


["726 


"2076 


A 


5711 


156 


423 


praqmOTIRKFEGQ>nCKFlIANARVQNCAn 
y'^pv.ijrpxrrr.PRRPDVMOKPCTCD 
I^Tp^^gV^FPIATIVUDVULVVlP 


[727 


T2077 


""A 


■ 5716 


"1 ~ 


"274 


rjCSSwVYmLVUOaFYIPLIHFKIRL 


728 


2078 


A 


5737 


1899 


649 


LPLSGGVDSAATACLlYSMCCyvi-tAviiwn 
m^VRTlVNQISrn'QDPRDLCQRILTrC 

fT^^^VMGn^LVTCKSPU'AAH^ 
S^S^IRMVLAYLFAQLSLWSRGVHGG 

SXsSANVDESIXGYLTKYDCSSADIOTIGG 

raamRAFVQFaQRFQLPALQSILLAPATAE 

iS^WDEEDMGMTYAELSVYGKL 

^vSo^F^CKLLGMWWBCTPRQVAD 

^^^WRHKMTTtTPAYHAENYSre 

S^LRPaYNTSWPWQFRCIENQVLQLER 

AEPOSLDGVD , 


729 


2079 


A 


5741 


1 


5976 


AACTVYLAAVNRLYQLSGANLSLEAEAAVG 

J5SsPIXMAP(WASCEHPmTONW!^ 
SpQQOLVVVCGSIYQGFCQLRRRGNKAV 
AA^AAPPAEPVTVFPSMLNVAANHPNAS 
i^^^AAOAGGSRIXVGATYTGYGSOT 
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wo 01/57188 



PCT/USOl/03800 



SEQID . 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQID 
m: of 1 
septide 
seq- 
uence 


Viet 

lod ^ 


SEQ 

[DNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence {A=Alanine (^Cysteine. 
D=Aspartic Acid, EpGlutamic Acid, 
F=Phenylalanme, CKJlycinc, H=Histidine, 
I«lsolcucinc, K=Cysine, LfC^cihc 
M^MethiCHiine, N=Aspaiaginc P=Pn)line. 
Q=G!utamine, R«Arginine» S=»Scrinei 
T=Threonine, V=Valiiie» W=Tiyptophan, 
Y=Tyrosine, X«=Unknown, •^Stop codon, 
/possible nucleotide deletion, \<=possibie 
nucleotide insertion 














FDLNPSDDNXUOKQGAKEQHKLGFVSAfLHP 

SDPPPGAQSYAYLALNSEARAGDKESQARSL 

LARiaPHGACSGDAKKLTESYIQLGLQCAGG 

AGRGDLYSRLVSVFPARERLFAVFERPQGSPA 

ARAAPAALCAFRFADVRAAIRAARTACFVEP 

APDWAVLDSWQGTGPACERKLNIQLQPEQ 

LDCGAAHLQHPLSBLQPLKA1TVFRAPGLTSV 

AVASVNNYTAVFLGTVNGRLLKINLNESMQ 

WSRRWTVAYGEPVHHVMQFDPADSGYLY 

LMTSHQMARVKVAACNVHSTCGDCVGAAD 

AYCGWCALETRCTLQQDCmSSQQHFWTSA 

SEGPSRCPAMTVLPSEIDVRQEypGMILQISGS 

LPSLSGMEMACa^YGNNIRTVARVPGPAFGHQ 

lAYCNLLPRDQFPPFPPNQDHVTVEMSVRVN 

GRNIVKANFnYDCSRTAQVYPHTACTSCLSA 

QWPCFWCSQQHSCVSNQSRCEASPNPTSPQD 

CPRTLLSPLAPVPTGGSQNILVPLANTAFFQG 

AALECSFGLEEDFEAVWVNESWRCDQWUH 

TTRKSQVFPLSLQLKGRPARPLDSPEPMTVM 

VYNCAMOSPDCSQCLGREDLGHLCMWSDGC 

RLRGPLQPMAGTCPAPEIRMEPLSGPLDOGT 

LLTIRGRNLGRRLSDVAHGVWIGGVACEPLP 

DRYTVSEEIVCVTGPAPGPLSGWTVNASKE 

GKSRDRFSYVLPLVHSLEPTMGPKAGGTRITI 

HGNDLHVGSELQVLVNDTDPCTELMRTDTSI 

ACTMPBGALPAPVPVCVRFERRGCVHGNLTF 

WYMQNPVrrAISPRRSPVSGGRTITVAGERFH 

MVQNVSMAVHfflGREPTLCKVLNSTLITCPSP 

GALSNASAPVDFFINGRAYADEVAVAEELLD 

PEEAQRGSRFRLDYLPNPQFSTAKREKWKH 

HPGEPLTLVIHVSTKGAGKEQDSLGLQSHEY 

RVKIGQVSCDIQIVSDRIIHCSVNESLGAAVGQ 

LPITIQVGNFNQTIATLQLGGSETAnVSIVICSV 

LLLLSWALFVFCTKSRRAERYWQKTLLQME 

EMESQIREEIRKGFAELQTDMTDLTKELNRSQ 

OEPFLEYKHFVTRTFFPKCSSLYEERYVLPSQT 

LNSQGSSQAQETHPLLGEWKIPESCRPNMEE 

GISLFSSLLDNKHFLIVFVHALEQQKDFAVRD 

RCSLASLLTIALHGKLEYYTSIMKELLVDLID 

ASAAWKLMLRRTESVVEKMLTNWMSrCM 

YSCLRETVGEPFFLLLCADCQQINKGSIDAITG 

KARYTLNEEWLLRENIEAKPRNLNVSFQGCG 

iv/inQi Q\/i?AianTnTT>TOVKEKILEAFCKNVPY 

SQWPRAEDVDLEWFASSTQSYILRDLDDTSV 

VEDGRKKLNTLAHYKIPEGASLAMSLIDKKD 

NTLGRVKDLDTEKYFHLVLPTDELAEPKKSH 

R n<5HTi KK VLPErVLTRLLSTKGTLQKFLDDLF 

KAILSIREDKPPLAVKYFFDFLEEQAEKRGISD 

PDTLHIWKTNSLPLRFWVNILKNPQFVFDtDK 

TDHIDACLSV1AQAFIDACSISDLQLGKD6F1W 

KLLYAKEIPEYRKIVQRYYKQIQDMTPLSEQE 

MNAHLAEESRKYQNEFNTOVAMAEIYKYAK 

RYRPQIMAALEANPTARRTQLQHKFEQWAL 

MEDNIYECYSEA 


730 


2080 


A 


5744 


3 


" 292 


■ ■ QPSPLFHSHLETLQLLRTAQLPEQVSWF WUQ 
VANGKGNQRNMGSPQPSLLAFERNLELQIMG 
LGYSLLMGKLRPRVAKDTLRVHRDSTPSPLT 

LKD 


731 


2081 


A 


5747 


1 


382 


- FLKCMRKAFRSSKLLQVGYTPDGKDDYRWU 
FRVDEVNWTTWNTNVGflNEDPGNCEGVKRT 
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wo 01/57188 



PCTAJSOl/03800 



SEQH) 
I NO: of 
nucl- 
eotide 
seq- 
uence 



SEQID 
NO: of 
peptide 
seq- 
uence 



2082 



734 



735 



738 



Met 
hod 



SEQ 


Predicted 


BONO: 


beginning 


in 


nucleotide 


USSN 


location 


09/496 


correspondi 


914 


ng to first 




amino acid 




residue of 




peptide 




sequence 



2084 



2085 



2088 



5753 



5788 



5527 



198 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



3 

2^ 



5870 



5881 



362 



1257 



268 



1160 



"Amino acid sequence (A»Aianine OCysteinc» 
I>Aspartic Acid, E?=Glutamic Acid, 
F=Phenylalanine, G=Glyciiie, H=Histldine, 
Msoleucine, K=Lysine, LHL^cine, 
M=Methionine, N=Asparagine, PMProlinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Tlireonine, V=Valine, W-Tiyptophan, 
Y=Tyrosinc, X«Unknown, *«Stop codoiu 



nucleotide insertion =mKrr?rTi?T7 

LSFSIJRSSRVSGRHWKNFALVPLLREASARD 



RQSAQPEEVYLRQFSGSLKPEDAEVFKSPAAS 
GEK 



AQAESSTVASPEATAGPLCTRlPNVFPmiKr 
PfiKLOAOLP CPSPVRFTSARIPPASRPQTKS 
AAGPPGLEAEGRAPESAGPGPGGDAAETFUL 
PPAHSGTLMMAFRDVTVQIANQNISVSSSTAL 
SVANCLGAQTVQAPAEPAAGKAEQGETSOR 
EAPEAPAVGREDASAEDSCAEAGASGAADG 
ATAPKTEEEEEEEETAEVGRGAEAEAGDLEQ 
LNRTSTSTKSAKSGSEASASASKDALQAMILS 
LPRYHCENPASCKSPTLSTDTLRKRLYRIGLN 
LFNINPDKGIQFUSRGFIPDTPIGVAHFLLQRK 
GLSRQMIOEFLGNSKKQFNRDVLDCWDEM 
DFSSMELDEALRKFQAHIRVQGEAQKVERUE 
AFSQRYCMCNPEWQQFHNPDTIFILAFAIILL 
IsJTDMYSPNIKPDRKMMLEDFIRNLRGVDDG 
ADIPRELWGIYERIQQKELKSNEDHVTYVTK 
VEKSIVGMKmSVPHRRLVCCSRLFEVTDV 
KKLQKQAAHQREVHJWIXVILKLCPKKKS 
SSTYTFCKSVGLLGMQFQLFENBYYSHGITLV 
TPLSGSEKKQVLHFCALGSDEMQKFVEDLKE 
SIAEVTELEQIRIEWELEKQQGTKTLSFKPCGA 
OGDPQSKQGSPTAKREAALRERPAESTVEVSI 
HNRLQTSQHNSGLGAERGAPVPPPDLQPSPPR 
OQTPPLPPPPPTPPOTLVQOQQIVKVIVLDKPC 
LARMEPLLSQALSCYTSSSSDSCGSTPLGGPG 
SPVKVTHQPPLPPPPPPYNHPHQFCPPGSLLH 

GHRYSSGSRSLV 



SSVMGDLVGQGLEEQIVAKDKNSWLUDOOIP 
IDDVMRVLDIDEFPQSGNYETIGGFMMFMLR 
KIPKRTDSVKFAGYKFEWDIDNYRIDQLLVT 

i?m<gif ATAT ^PKLPDAKDKEESVA 

* "^MNUl. 



MVFSAVLTAFHTGTSNl'lVV V ybNl 
PPPFQHPDLSPLLRYSFETMAPTGLSSLTVNST 
AVPTTPAAFKSLNLPLQITl^AIMIFILFVSFLG 
NLWCUS4VYQKAAMRSAINIIXASLAFADM 
LLAVLNMPFALVnLTTRWGKFFCRVSAMF 
FWLFVffiGVAJLLHSEDRFLDVQRQDKLOTYR 
AKVLIAVSWATSFCVAFPLAVGNPDLQIPSRA 
POCWGYTTNlPGYQAYVILrSLISFFIPFLVILY 
SmGILKTLRHNALRfflSYPEGlCLSQASKLGL 
MGLORPFQMSIDMGFKTRAFTTILILFAVFIVC 
WAPFTTYSLVATFSKHFYYQHNFFEISTWLL 
WLCYlJCSALOTUmVRIKKFHDACLDMMP 
ggPCTLPOUHjKIKRRIRPSAVYVCGEHRTVV 



FTRSDELARHYRTHTGEKRi''SCPLCPKQl-SKii 
DHLTKHARRHPTYHPDMIBYRGRRRTPRIDPP 

T T5;P.VESS A SGSGPGPAPSFTTCL 

LTWPOmJnLPEIXHMSKPAEDGPSPUALVK 

RSSSLGYISKAJBEYFLLKSRSDLMFEKQSERH 

GLARRLTTARRPPASSEQAQQELFNELKPAV 

DGAjmVhlHMRDQNNYNEEKDSWNRVART 

VDRLCli=VVTPVMWGTAWIFU^ 

OPFPGPPY SYNVQDKRFI 

LVVTArrAlLAFPNEYTRMSTSELISELiTMLPt^G 
LLDSSKLCDYENRFNTSKGGELPDRPAGVGV 
YSAMWQIjaTLniaVIlTFIFGMKIPSGLFIPS 
i;^^x/nATAom.T.nvnMEQLAYYHQEWrVFNS 
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PCTAJSOl/03800 



SEQID j b" 
KO:of K 
nucl- p 
cotide Si 
seq- « 
uence 


EQID N 
0:of h 
Bpttde 
5q- 
ence 


let S 
od E 
i 
I 

C 
S 


EQ F 
DNO: b 

JSSN 1 
>9/496 c 
>H r 
s 
I 

: 


redicted 1 f 
eginning n 
ucleotide | 1 
ocation c 
wrrespondi t 
ig to first ! 
iminoacid ( 
•esidue of J 
peptide 
sequence 


redicted end > 
ucleotide I 
ocation 1 
orrcsponding 1 
0 last amino i 
Lcid residue J 
)f peptide ' 
jequence 


Siino acid sequence (A=AiaMne 0=Cysteine, 
>Aspartic Acid, EFGlutamic Add, 
?=Phenylalanine, G=Glycine, H=Histidme, 
=lsoleucinc, K=Lysine, L-^Lcucinc, 
vl=Mcthioninc, N^Asparaginc, P=Prolinc, 
5=Glutamine, R=Arginine, S=Serinc, 
r^Threonine, V^Vaiine, W-Ttyptophan, 
VssTvrosine. X=Unknown, *=5top codon, 
possible nucleotide deletion, V=possible 

nucleotide insertion 

WCSQGADCITPGLYAMVOAAACLCiGV IRMT 














VSLVVIMFELTGGLEYlVPLMAAAMrSKWVA 
DALGREGlYDAHIRU^GYPFLEAKEEFAHm, 

A^^vMKPW^PLLmTQDS^m^DVBra 

SETTYSGFPVWSRESQRLVGFVLRRDLnSIE 
HAJUCKQDOWSTSIIYFreHSPPLPPYTPPTLK 
LRNILDLSPFIVrDLTPMEIVVDIFRKLGLRQC 
T ^rruxir'T?! T nHTKICDVLKHlAOMANQDPDSI 

1 FN 


739 


2089 


A 


5892 


2 


916 


TLOLAASVPFFAlSLlSWWLPESARWUINUltr 

DQWLRKVARmOHKEAKNLTffiVLNBW 

K^VASAKEPRSVLDLFCVPVLRWRSCiW^V 

,^TT^i^T T Towr-T \nn^T n^l fiRDTFLLOALFGA 
VNFSLLISYYGLVrJJLW^A-'*Ji^^*^*^*<'^* 

VDFLGRATTALLLSFLGRRTIQAGSQAMAGL 
AILANMLVPQDLQTLRWFAVLGKGCFGISL 
TCLTlYKAELFPl?VRMTADGtt.HTVGRLGA 
imGPLlLMSRQALPLLPPlXYGVISIA^V^ 
FFLPETQGLFLrJJl lyubDovir*^ i ^^^^ ^ 
Ainw^SIXEIVALHGAL 


740 


2090 


A 


5900 


2 1 


426 


-^RKlTXJIGFHFSVDOVHttigRbVyNLWlsJi^ 
>n.IILOTAKKHGYEVVDTFnTMGRy^HjQO 
KCGCHFHEWKSKLSKEYNFIKMKRSKNim^ 
ORYFSNQSKLQQGTVTNFRSPYHVRGPINQV 

■ RMPESTlJLllCE>jGYILEAPmil^Ui^"m>^ 


741 


2091 


A 


5910 


3 


412 


VSYEKDMCKCFHFSSVKSKILRliEIEKW^ 
ORELKEKIREERBNKLAAEMOEDGEKEFQ^ 
EEEKEEEEEEEEPLPEinPSTPSPlLaSFYSEPO 

T^FWV 


742 


2092 


A 


5936 


1 


1482 


MOCRLL(XVWCIJLQAOPUJlAVSQTPltYLV 
TQMGOTKSIKCBQNLGHMMW^QDSIOC 
FlWtFSVlWKEUINETVPNRPSPKSPDKAHL 

HKPPOSAWCLQaSVCTCTQOSSLHSLMASDO 

YPVC ^ 

■ MNSFFGTTAASWCLLESUVSSAPDlUiAUKW*. 


743 


2093 


A 


" 5938 


"1 


TT566 


RALSVQQRGOPAWSGSLEWSRQSAGDFRW- 

Hlsrqt^jcsswsrsrdrtcccrrawwi^ 

ADRAWpiMNEKWDTNSSENWIVWmJ 
DTiafflLYSDlNrryVNYYLHQPQVAAIFnSYF 
jjPpjrMMGNTWOTVMRNKHMHTVTNLn 
LNLAlSDLLVGIFCWraLDNIIAGWPFCTmi 
CKISGLVQGISVAASVFTLVAIAVDRFQCWY 
PFKPKlTKTAFVlIMnWVLArnMSPSAVWLH 

L^AVPHTGRKKQEQWHVVSRKKQKIIKMLU 
VAIXFILSWXJL^m-NmLSDYADLSPNlELQn 
ilnYTVPFAHWl^GNSSVNPIIYGFFNENFRRG 
FQEAFQLQLCQKRAKPMEAYALKAKSHVLDI 

tKvqesttqnphgetllyrksaekpqqe 

T VMKP.I.KRTTNSSEI 


744 


2094 


A 


5966 


149 


327 


- -SHVCVSHYAGSSGCPAUAUAGAVAJUUISAVA 

I vnvQGGRLGVARGAVmffiAPDIRQGDM _ 

- -GAPHTDWAWAPTPMSULOSURUKQOl LASS 


745 


2095 


A 


5970 


413 


856 


PLSLPUXAOVTGILATHJDQMARPAACN^ 
S^AIAWIMULVGLGFPFIMEALSHFLYW^ 
GVCVCGAIYTGUTJBTKGKTT^EISKEUKL 
MProRAQGPrWRSLEVIQSTEL 1 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 



SEQlD 
NO: of 
peptide 
seq- 
uence 



746 



747 



748 



2096 



2097 



Met 
hod 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



5971 



2098 



Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequqice 

3 



5998 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



1343 



JmiDO acid sequence CA=Aianine C=Cysteiner 
I>Aspartic,Aeid, EFGlutamic Acid, 
F^Phenylalanine, OKJlycine, HHEUsddine, 
I^Isoleudnc, K-Lysinc, L=Lcucine, 
M=Methiomne, N=Asparagine, P=Proline» 
QsGlutamine, R^Arginine, Ss^Scrine, 
T=Threonine, V-VaUne, W»Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦«Stop codon, 
A=possible nucleotide deletion, Visible 
nucleotide insertion 



754 



6001 



AQTARRnGLELDTEGHRLFVAFSGaVYLFLS 

RCARHGACQRSCLASQDPYCGWHSSRGCVDI 

RGSGGTDVDQAGNQESMEHGDCQDGATGSQ 

SGPGDSAYGVRRDLPPASASRSVPIPLLLASV 

AAAFALGASVSGLLVSCACXRAHRRRGKDIE 

TPOLPRPLSLRSLARLHOGGPEPPPPSKDGDA 

VQTPQLYTTFLPPPEGVPPPELACLPTPESTPE 

LPVKHLRAAGDPWEWNQNRNNAKEGPGRSR 

GGHAAGGPAPRVLVRPPPPGCPGQAVEVTTL 

EELLRYLHGPQPPRKGAEPPAPLTSRALPPEP 

APALLGGPSPRPHECASPLRLDVPPEGRCASA 

PARPALSAPAPRLGVGGGRRLPFSGHRAPPAL 

LTRVPSGGPSRYSGGPGKHLLYLGRPEGYRG 

RALKRVDVEKPQLSLKPPLVGPSSRQAVPNG 

GRFNF . . 

DHASLPCSWNHRFDVKl'RHVnGDHSOQ>rn 

UaEQENCILVTTFRGHTOOVTALCWDPVQ 

RVLFSGSSDHSVIMWDIGGRKGTAIELQGHN 

DRVQALSYAQHTRQUSCXKSDGGIWWNMD 

VERQETPEWLDSDSCQKCDQPFFWNFKQMW 

DSKKIGLRQHHCRKCCHCAVCGKCSSKRSSIPL 

MGFEFEVRVCDSCHEAITDEERAPTATFHDSK 

HNIVHVHFDATRQWLLTSGTDKVIKLWDMT 

PWS 



747 



AMVFGGWPYVPQYRDIRRTQNADGFS lY V 
CLVLLVANIUULFWFGIUUFESPLLWQSAIMIL 
TKILU^KLCTEVRVANELNARRRSFTAADS 
KDEEVKVAPRRSFLDFDPHHFWQWSSFSDYV 
QCVLAFTGVAOYITYLSIDSALFVETLGFLAV 
LTEAMLGVPQLYRNHRHQSTEGMSIKMVLM 
WTSGDAFKTAYFLLKGAPLQFSVCGLLQVLV 
PLAILGQAYAPARHPQKPAPHAVHPTGTKAL 



5^ 



A 6002 



750 



2100 



447 



GRPDRSELVRMHILEETFAEPSLQATQMKLK 
RARLADDLNEKIAQRPGPMELVEKNILPVDSS 
VKEAnGVGKEDYPHTQGDFSFDEDSSDALSP 
DQPASQESQGSAASPSEPKVSESPSPVTTNTP 
AQFASVSPTVPEFLKTPPTAD 



A 6004 



427 



6007 



33 



1280 



LLTQAMLVLlPHRPQWFTPGPRLQAQGPCQEG 
WRWELRLRNYVPEDEDLNKRRVPQAKPDAV 
QEKVKEQLEAAKPEPVIEEVDLAKLAPRKPD 
WDLKRDVAKKLEKLLKRTQRAIAELIRERLK 
GQEDSLDSAVDAATEHKTC 



TDQAKVDNQPEKLVRSAEDVSTVPTQPDNPl- 
SHPDKLKRMSKSVPAFLQDESDDRETDTASE 
SSYQLSRHKKSPSSLTNLSSSSGMTSLSSVSGS 
VMSVYSGDFGNLEVKGNIQFAIEYVESLKEL 
HVFVAQCKDLAAADVKKQRSDPYVKAYLLP 
DKGKMGKKKTLWKKTLNPVYNEILRYKIEK 
QILKTQKLNLSIWHRDTFKRNSFLGEVELDLE 
TWDWDNKQNKQLRWYPLKRKTAPVALEAE 
NROEMKl^QYWEPWGKKLPTTGEVHIWV 
KECU5IJP1XRGSHLNSFVKCTILPDTSRKSRQ 
KTRAVGKTTNPIFNHTMVYDGFRPEDLIVIEAC 
mXVWDHYKLTNQFLGGLRIGFGTGKSYGT 
EVDWMDSTSEEVALWEKMVNSPNTWIEATL 

PLRMLLIAKISK - 

KEIFSPFEUSVKPLCLLLGVTCSQSMAFEELL 



752 



2102 



6028 



108 



1283 



SQVGGLGRFQMLHLVFILPSLND-LIPHILLENF 
AAAIPGHRCWVHMLPNNTGSGNETGILSBPA 
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wo 01/57188 



PCT/USOl/03800 



SEQID i 
NO: of > 
nucl- F 
eotide s 
scq- I 
uence 


EQID ^ 
10: of t 
leptide 
fiq- 
lence 


Aei 5 
lod I 
i 
I 
( 


»EQ I 
DNO: I 
Q r 
JSSN 1 
)9/496 ( 
)14 1 
1 
1 


'redicted 1 ] 
wginning i 
lucleotide 
ocation 
»nespondi 
ig to first 
unino acid 
residue of 
peptide 
sequence 


Predicted end i 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Smino acid sequence (A«Aianinc OCTsteme, 
[>Aspartic Acid, E^Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I=lsoleucine» K=Lysine, L=Uucine, 
M=Methionine, N='Asparagine, P^Proline, 
Q=Glutaniine, R=Arg;inine, S=Serine, 
T=Threonine, V^Valine, W=Tryptoplian, 
Y^Tyrosine, X^Unknown, *=Stop codon, 

/— nncciVilf> niirlpnHfle deletion. V^DOSSlblc 
nucleotide insertion 














LLRISIPLDSNLRPEKCRRFVHFQWQLLHLNU 

TIHSTSEADTEPCVDGWVYDQSYFPSTIVTKW 

DLVCDYQSLKSWQFLLLTGMLVQGHGGHV 

SDRFGRRnLRWGLLQLATTDTCAAFAPTFPV 

YCVLRFLAGFSSMinSNNSLPITEWIRPNSKAL 

WILSSGALNIGQIILGGIAYVFRDWQTLHVV 

ASVPFFVFFLLSRWLVESARWLnTNKLDEGL 

KALRKVARTNGDCNAEETLNIEWRSTMQEE 

LDAAQTKTTVWDLFRNPSMRKRICILVFLRK 

KNLKEKA 


753 


2103 


A 


6043 


1 


1470 


DSFESDLRLIFEIHHSGEKGDIWFLACEQDIEK 

VCETVYQGSNLNPDLGELVWPLYPKEKCSL 

FKPLDETEKRCQVYQRRWLTTSSGEFLIWSN 

SVRFVIDVGVERRKVYNPRIRANSLVMQPISQ 

SQAEIRKQILGSSSSGKFFCLY ibbl* AbJSJ^ivi i r 

LKPAEMQEANLTSMVLFMKRIDIAGLGHCDF 

MNRPAPESLMQALEDLDYLAALDNDGNLSE 

FGIIMSEFPLDPQLSKSILASCEFDCVDEVLTIA 

AMVTAPNCFSHVPHGAEEAALTCWKTFLHPE 

GDHFTLISIYKAYQDTTLNSSSEYt^ViJis.wvjvu 

YFLNCSALRMADVIRAELLEIIKRIELPYAEPA 

FGSKENTLNKKALLSGYFMQIARDVDGSGN 

YU^THKQVAQLHPLSGYSITCKMPEWja,^^ 

HKFSISENNYIRITSEISPELFMQLVPQYYFSNL 

PPSESKDILQQWDHLSPVSTMNKEQQMCET 

rPETEORCTLO 




2104 


A 


6055 


2 


394 


YYAUfflWPFPDLLCQTTOAIFQMNMYUSCIF 
LMLINVDRYAAIVHPLRLRHLRRFRVARLLC 
LGVWAULVFAVPAARVHRPSRCRYRDLEVR 
LCFESFSDELWKGRIii>LVLLAEALGFLLPLA 


755 


2105 . 


A 


6059 


3 


1795 


LGLGSGTLLSVSEYKKKYRHHVLQLHARVK^ 
RNARSVKTIXRFTKLLIAPESAAPEEALGFAEE 
PEPGRARRSDTHTFNRLFRRDEEGRRPLTWL 
QGPAGlGKTMAAKKH-YDWAAGKLYQGQVp 
FAFFMPCGELLERPGTRSLADLILDQCPDRGA 
PWQMLAQPQRIiJnLDGADELPALGGPEAAP 
CTDPFEAASGARVLGGLLSKALLFTALLLVTT 
RAAAPGRLQGRLCSPQCAEVRGFSDKDKKK 

« n^ivmTO nT?DD A T3D A VH PVK FNETLF ALCFV 

YFYKFFRIJiiWlAJiKA i tsr v^jii^ti i i^i n^*-** 

PFVCWIVCTVLRQQLELGRDLSRTSKTTTSVY 

LLFITSVLSSAPVADGPRLQGDLRNLCRLARE 

GVLGRRAQFAEKELEQLELRGSKVQTLFLSK 

KELPOVLETEVTYQFIDQSFQEFLAALSYLLE 

T-»i"m/PPTA AfifiVGTLLRGDAOPHSHLVLTT 

RFLFGLLSAERMRDIERHFGCIVIVSERVKQEA 

LRWVQGQGQGCPGVAPEVTEGAKGLEDTEE 

PEEEEEGEEPNYPLELLYaYETQEDAFVRQA 

LCRFPELAlA2RVRrCRMi>VAVi*a i i^v jvvA^rr* 

GQALRLISCRLVAAQEICKKKSLOKRLQASLO 


r75i 


-■2106 


A 


6060 


12 


436 


" ^^FfRPAKPTGQGMGR^mTLVCQUSlMMS 
ARDLIMNNLTELQPGLFHHLRFLEELRLSGNH 
LSHIPGQAFSGLYSIiaLMLHNNQLGGIPAQA 
LWELPSLQSLRLDANLISLVPERSFEGLSSLRH 

l.WLDDNALTEIPS 


757 


2107 


A 


6063 


54 


419 


- TTTLGLGAADMCAbPWUiXLLLQEGSyKKi. 
WRWCGSEEWAVU3ESlSLPLEIPPDEEVENn 
WSSHKSIAT\nWKEGHPATTMVTNPHYQG 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Aianinc C=Cystein^ 
D=Aspartic Acid, EF=Glutamic Acid, 
F'^enylalanine, G^Glycine, H=Histidine, 
T—icnim trine K^Lvsinc. LHLcucinCi 
M=Methionine, N=Asparagiiie. P^Prolin^ 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V=Valine, W^Tryptophan, 
Y^Tyrosine, X=Unlaiown, ♦=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 

r\n TKyfT T DC! HTlP^ A^WPRHPSSSCSW 


758 


2108 


A 


6066 


125 


438 


IGISCPATDFVPMFSHSLIGIGEEYQLPYYNMV 
PSDPSYEDMREWCVKRLRPIVSNRWNSDEC 
LRAVLKLMSECWAHNPASRLTALRKKTLAK 
MVESODVKI 


759 


2109 


A 


6072 


3 


650 


PGRRFRPAALEERAMEKLRKKVPFQNRGKGT 

LSSIIPNNSDTRKATETTSLSSKPEYVNPDFRW 

SKDPSSKSGNLLETSEVGWTSNPEELDPIRLA 

LLGKSGLSCQVGSATSHPVSCQEPlDEDQKlbr 

KDKSTAGRBFSGQVSHQTTSENQCTPIPSSTV 

HSSVADMQNMPAAVHALLTQPSLSAAPFAQ 

RYLGTLPSTGSTTLPQCHAGNATVW 


7oU 






6077 


3 


730 


PLRLTLMEEVLLLGLKDREGYTSFWNDGISSU 

LRGCMLIELPLRGRLQLEACGMRRKSLLTRK 

VICKSDAPTGDVLLDEALKHVKETQPPETVQ 

NWIELLSGETWNPLKLHYQLRNVRERLAKNL 

VEKGVLTTEKQNFlJ-FDNfnWLTONNlKQR 

LIKKVQEA\aJ)KWVNDPHRMDRRLLALIYL 

AHASDVLENAFAPLLDEQYDLATKRVRQLLD 

LDPEVECLKANTNEVLWAWAAFTK 


761 


2111 


A 


6078 


833 


390 


IVSFHLSGFKKFVRPFSFLSVHGLQVDEYH5V 
HQKLSADMADHSNLIRSIXVGAEDARJLMRD 
MKTMKSRYMELYDLNRDLLNGYKIRWNNH 
TELLGNLKAVNQAIQRAGRLRVGKPKNQVrr 
ACRDAIRSNNINTLFKIMRVGTASS 


762 


2112 


A 


6079 


2 


2686 


KKAITCGEKEKQDLDCSLAMLKDGFRTDRGb 
HSDLWSSSSSLESSSFPLPKQYLDVSSQTDISG 
SFGINSNNQLAEKVRLRLRYEEAKRRIANLKI 
QUlKLDSEAWPGVLDSERDRLILINEKEELLK 

emrfisprkwtqgeveqlemarkrlekdlq 
aarettqskalterlklnskrnqlvreleeat 

RQVATLHSQLKSLSSSMQSLSSGSSPOaTSSR 

gslvassldsstsasftdlyydpfeqldselq 

SKVEFLLLEGATGFRPSGCrrriHEDEVAKTQ 

kaegggrlqalrslsgtpksmtslsprsslss 
psppcsplmadpllagdaflnslefedfelsa 
tlcelslgnsaqeryrleepgteokqlgqav 
ntaqgcglkvacvsaavsdesvaodsovye 

ASVQRLGASEAAAFDSDESEAVOAraiQIALK 
YDEKNKQFAILnQLSNLSALLQQQDQKVNIR 

vavlpcsesttclfrtrpldasdtlvfnevfw 
vsmsypalhqktulvdvcrtdrshleeclgg 

A0ISLAEVCRSGER5TKW YNLi*a rR.rL.ivR.vo 

relkpvgvmapasopastdavsalleqtave 
lekrqegrsstqtledswryeetseneavae 
eeeeeveeeegeedvftekaspdmdgypalk 
vdketntetpapsptwrpkdrrvgtpsqgpf 

LRGSniRSKTFSPOPQSQYVCRLNRSDSDSST 
LSKKPPFVBNSLERRSVRMKRPSPPPQPSSVK 
SLRSERLIRTSLDLELDLQATRTWHSQLTQEIS 
VLKELKEQLEQAKSHGEKELPQWLREDBRFR 
LLLRMLEKRMDRAEHMGELQTDKMMRAAA 
KDVHRLRGQSCKEPPEVQSFREKMAFFTRPR 
MNIPALSADDV 


763 


2113 


A 


6082 


3 


1558 


' PHPIRFSKLCVSFNNQEYNQFCVIEEASKAWii 
\a.ENLTQGmCLVPGKTRKLLFKFVAKTED 
VGKKIEITSVDLALGNETGRCWLNWQGGGG 
DAASSQEALQAARSFKRRWOPDNEVHWGSn 
lOASTNmSRWNISVHUiffiPPALimiYCLV 
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SEQ ID J 
NO: of 1 
nucl- I 
eotide : 
seq- 1 
uenoe 


SEQ ID 1 
^0: of I 
peptide 
seq- 
uence 


Viet \ 
lod 1 

i 
1 
( 


SEQ 1 
DNO: 1 
n 

JSSN 
[)9/496 
914 


Predicted 
seginning i 
lucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


f>redictcd end 
tiucleotidc 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


f\mino acid sequence (A-Alanine OCystein^ 
>=Aspartic Acid, &<}lutamic Acid, 
?=Phenylalamne, G=Glycine, H=Histidine, 
[=Isoleucine, K=Lysine, L?=Leucine, 
M[=Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y«Tyrosine, X^Unknown, *«Stop codon, 
/==possible nucleotide deletion, >r^ssible 
nucleotide insertion 

Ti^if\cn rcvTnnirwjvi TAm If PfrOnANLTQIC 














V^l VQSHEKTvlRL)VKL 1 AUJUKrvjvA-'^^^'' x*^ 

THVTLHGTELCDESYPALLTDIPVGDLHPGEQ 

LEKMLYVRCGTVGSRMFLVYVSYLINTTVEE 

KEIVCKCHKDETVTTETWPFDVAVKFVSTKF 

EHLERVYADIPFLLMTDlXSASPWALTrVSSE 

LHLAPSlvrrrVDQLESQVDNVILQTGESASECF 

CLQCPSLGNIEGGVATGHYnSWKRTSAMENI 

PnTTVITLPHVlVENIPLHVNADLPSFGRVRES 

LPVKYHLQNKTDLVQDVEISVEPSDAFMFSG 

LKQIRLRILPGTEQEMLYNFYPLMAGYQQLPS 

lninllrfpn™qlliotiptsifvkpqgrlm 
ddtsiaaa 


764 


2114 


A 


6093 


1 


1422 


AAADLANSNAGAAVGRKAGPRSPPSAPAFAl' 

PPPAPAPPTLGNNHQESPGWRCCRPTLRERN 

AL^!F^mELMADVHFVVGPPGATRTVPAHKY 

VLAVGSSVFYAMFYGDLAEVKSEIHIPDVEPA 

AFLILLKYMYSDEIDLEADTVLATLYAAKKYI 

VPALAKACVNFLETSLEAKNACVLLSQSRLF 

EEPELTQRCWEVIDAQ AEMALRSEGFCEIDR * 

QTLEIIVTREALNTKEAWFEAVLNWAEAEC 

KRQGLPITPRNKRHVLGRALYLVRIPTMTLEE 

FANGAAQSDILTLEETHSIFLWYTATNKPRLD 

FPLTKRKGLAPQRCHRFQSSAYRSNQWRYRG 

RCDSIQFAVDRRVFIAGLGLYGSSSGKAEYSV 

KIELKRLGVVLAQNLTKFMSDGSSNTFPVWF 

EHPVQVEQDTFYTASAVLDGSELSYFGQEGM 

TEVQCGKVAFQFQCSSDSTNGTGVQGGQIPE 

LTFYA 


765 




A 


6099 


1 


1150 


SGFTHYAIYDFlVKGSCFCNVHADQCirVHur 

RPVKAPGTFHIvrVHGKCMCKHhrrAGSllLI^H 

CAPLYNDRPWEAADGKTGAPNECRTCXCNG 

HADTCHFDVNVWEASGNRSGGVCDDCQHN 

TEGQYCQRCKPGFYRDLRRPFSAPDAOCPCS 

CHPVGSAVLPANSVTFCDPSNGDCPCKPGVA 

GRRCDRCNTVGYWGFGDYGCIux^JJt-Auat-u 

PITGDCISSHTDIDWYHEVPDFRPVHNKSEPP 

WEWEDAQGFSALLHSGKCECKEQTLGNAKA 

FCGNfiCYSYVLKIKILSAHDKGTHVEVNVKIK 

KVLKSTKLKIFRGKRTLYPESWTDROCTCPIL 

NPGLEYLVAGHEDIRTGKUVNMKSFVQHWK 

P SLGRICVMDILKRECBt 


766 


2116 


A 


6103 


2 


384 


" l^TAAATATVUCEGVLEkRSOGIXQLWKKMt 
CVLTERGLQLFEAKGTGGRPKELSFARIKAVE 

/^irooT/:ioiiIVPTl VTPfMinFIDFRCPLEDPGW 

NAQITLOLVKFKNQQAIQTVRARQSLOTOTL 
vs 


767 


\ 2117 


A 


6106 


1 


542 


' SGSSHASDGSGFQELRICSEDQTPUAUMU5i.r 
MARYYIIKYADQKALYTRDGQLLVGDPVAD 
NCCAEKICTLPNRGLDRTKVPIFLOIQGOSRC 
LACVETEBGPSLQLEDVNIEELYKGGEEATRF 
TFFQSSSGSAFRLEAAAWPGWFLCGPAEPQQ 
PVOLTKESEPSARTJCFYFEOSW 


768 


2118 


A 


6109 


3 


292 


- FILQAVLQLSSQEARYKAFG'rcVSHlUAlLAF 
YTPSVISSVMHRVARCAAPHYfflLLANFYLLF 
PPT^A^in'GVKmOIRDSLGSIPEKGCVNRE 


769 


2119 


A 


6110 


1 


711 


- RHEPSCSNGVASlKSkQNHSKYPAPSSSSSSS 
SSSSSSSPSSVNYSESNSTDSTKSQHHSSTSNQ 
ETSDSEMEMEAEHYPNGVLGSMSTWVNGAY 
KHEDLQTOESSMDDRHPRROLCGGNOAATE 
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SEQID 5 
NO: of 1 
nucl- I 
eotide £ 
seq- I 
uence 


rEQID f 
sIO: of \ 
)Cptide 
>eq- 
jence 


lod I 
i 
1 
( 

{ 


5EQ 1 
DNO: 1 
n 1 
UfSSN 
}9/496 
914 


Predicted J 

jeginning 

lucleotidc 

ocatioD 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


'redicted end 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Aianine OCystelnc, 
D=»Aspartic Acid, E^Glutamic Acid, 
F'^'henylalaninc, O^Olycine, H=»HistidinCj 
[=lsoleucine, K=Lysine, LFLeucine, 
M=Metiiionine, N^AsparagJne, P*=Pioline, 
QMjlutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y«Tyrosine, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, V^ssible 
nucleotide Insertion 














RULFGRELQALSEQLGREYGKNLAtfTEMLQD 
AFSLLAVSDPWSCPVGQQLDPIQREPVCAAL 
NSAILESQNLPKQPPLMLALGQASECLRLMA 
RAGLGSCSFARVDDYLH 


770 


2120 


A 


6125 


2 


570 


~\rxtn.\ >JT MvnWT (l>OJVFLLOTLFGAVlLLANC 
VAPWALKYMNRRASQMLLMFLLAICLLAllF 
VPQEMQMLREVLATLGLGASALANTLAFAH 
GNEVIPrnRARAMGINATFANIAGALAPLMNfl 
ILSVYSPPLPWnYGVFPFISGFAFLLLPETRNK 
PLFDTIODEKNERKDPREPKQEDPRVEVTQF 


771 


2121 


A 


6126 


909 


353 


-RSFVLDTASAICNYNAHVKNHPKYWCROYb 

RDYCNIIAFSPNSTNHVALRDTGNQLIVTMSC 

LTKEDTGWYWCGIQRDFARDDMDFTELIVT 

^^-r^Trr^Tn A vTrM7ii7or'i(mf cnNlfTBSCICAPKVV 
DDKGTLANDFWoujsi^i^ounA.iKDvi^-r^ ^ 

RKADRSRTSILnCILrrGLGnSVISHLTKRRRS 

QRNRRVGNTLKPFSRVLTPKEMAPTEQM 


772 


2122 


A 


614S 


7 


810 


FVLGILALSHTlSPFMNKFFPASFPNRQYgLl-i' 

TQGSGENKEEUNYEFD1KDLVCLGLSSIVGV 

WYLUUCHWIANNU'GLAFSLNGVEIXHLNN 

VSTGCn-LGGLFIYDVFWWQTNVMVTVAKS 

FEAPKLVFPQDLLEKGLEANNFAMLGLGDV 

VIPGIFlAUXRFDISLKKKnnTFYTSFAAYIF 

GLGLTIHMHIFKHAQPALLYLVPACIGFPVLV 

ALAKGEVTEMFSYEESNPKDPAAVTESKEGT 

RASASKGLEKKEK 


773 


2123 


A 


6161 


3 


1088 


FTFLLYKFLKESAOEPLFMLYCADCHQMEKG 

PIDAITGEARYSLSEDKLIRHUDYKTLTLNCV 

NPENENAPEVPVKGLDCDTGTQAJCEKIXDA 

AYKGVPYSQRPKAADMDLEWRQGRMARITL 

ODEDVTTKIDNDWKRLNTLAHYQVTDGSSV 

ALVPKQTSAYNISNSSTFTKSLSRYESMLRTA 

SSPDSIJCSRTPMTIPDLESGTKLWHLVKNHDH 

T T^rkDCrtrfcpawMVSFTYLTRLLATKGTLOKF 

VDDLFETIFSTAHRGSALPLAIKYMFDFLDEQ 

ADKHQIHDADVRHTWKSNCLPLRFWVNVIK 

>3PQFVFDIHKNSrrDACLSW 


774 


2124 


A 


6163 


860 


125 


- KTAVKKRNLNPVFNETLRYSVPQAELQGRVL 
QT WWHRFSLGRNIFLGEVEVPLDTWDWGSE 
PTWLPLQPRVPPSPDDLPSRGLLALSLKYVPA 
GSEGAGLPPSGELHFWVKEARDLLPLRAGSL 
DTYVQCFVLPDDSRASRQRTRWRRSLSPVF 
NHTMVYDGFGPADLRQACAELSLWDHGALA 
NRQLQGTRLSLGTGSSYGLQVPWMDSTPEEK 
pLWOALLEOPCEWVDGLLPLRTNLAPRT 


775 


2125 


A 


6191 


2 


" 392 


" ■ARGIGSLGRDHSGSGGGI'GMAGAWVRKAAU 
YVRSKDFRDYLMSTHFWGPVANWGLPIAAIT 
DMK\KSPmiSRRMTTAL*CYSLTFVRFAHYVQ 
\PWNWUvlLOCHTAVDFDQLISbMrUiollulvi x 
ASASAL 


776 


2126 


A 


6217 


1 


827 


- FRGYWGVREAHBASWiiGGLGPGlU'GMKJi 
RQKHAKKHLGFFRNNFGVREPYQILLDGTFC 
OAALRGRIQLREQLPRVLMGETQLCTTRCVL 
KELETLGKDLYGAKLIAQKCQVRNCPHFKNA 
VSGSECLLSMVEEGNPHHYFVATQDQNLSVK 
VKKKPGVPIJStfFnQKrMVU)ia*SPKTIAFV^ 
VESG\RLSQCMRKKVSNISKR>IRV**KTLNRG 
RRKKRKKISGPNPLSaKKKKKAPDTQSSASE 
KKRia«RIRNRSNPK\nLSEKONAEGE 



239 



wo 01/57188 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


"SEQ 

IDNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

Qg to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aianme OCysteme. 
D=Aspartic Acid, EHjlutamic Acid, 
F— Pncnylaianine, u^vjiycmc, n i:usuuui«i» 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=Proliiie, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, W=Tiyptophan, 
Y«Tyrosine, X«Unknown, ♦«Stop codon, 
/=T)0SSiDie nucieouac acieuoui rTJwasiuic 
nucleotide insertion 


777 


2127 


A 


6236 


1038 


1402 


YYQ1SSLPSIVGNGIFLWLL1C1FLAKQGUSKL' 
FQPFGRPRGGGHLRSGVLGQPGQHGETP/SFF 
YNSKISPALWGPPVIPSALGGEAGKSL*PRRQ 
RFORGGl/^LPSRVRGKAKJ^rLJsJsJv 


778 


2128 


A 


6237 


422 


913 


ASFFHHHRGAFLLLLAffGS*GQDQSLIHWSN 
AVSNAD\LLDLKVN*LDH\LEEKMPLVEVKVVP 
PQVL\SEPN*RSGGCFSAPSFEVPPWTGEVKP/ 
SPQRDGGALG\QGPLGffSDSILALLKKQT*RA 
LLNWPLOSLRRSSCaFQOQDGQDLKPRSGLGC 

NSFRYRR 


779 


2129 


A 


6249 


420 


36 


ARAPSPSFSVRDVELSDPARERGEMFVAVUl' 
YGQSQPSCFDRVKMGFVMGCAVGMAAGAL 
FGTFSCLSSILVSSSG/SGMRGRELMGGIGKTM 
MQSGGTFGTFMAIGMGIRC*PWLPTTSVPSH 

OSOPMY 


780 


2130 


A 


6263 


415 


1380 


" RJlsmdCDRGIQMU rrVUAFAAFSLMTIAVU 
TDYWLYSRGVCRTKSTSDNETSRKNEEVMT 
HSGLWRTCCLEGAFRGVCKKIDHFPEDADYE 
QDTAEYLLRAVRASSVFPILSVTIXFFGGLCV 
AASEFHRSRHNVILSAGIFFVSAGLSNnGIIVYI 
SV^NAGRTPGQRNDSKKSYSYGWSF/YFSGAFS 
FnGR/IIC»GVGLPWHIYIEKHQQLRAKSHSEF 
UCKSTFARLPPYRYRFRRRSSSRSTEPRSRDLS 
PISKGFOTIPSTDISMFTLSRDPSKrrMGTLLNS 
DRDHAFLQFHNSTPKJ3FKESLHNNPANRRTT 

PV 


781 


2131 


A 


6274 


S32 


318 


-OK^^aDLKQTLAlK'lAYPRCKCLVEMDQl^H 
LOVKQKQLACLCTWQARDPDCPPSTKVVLA- 
VGPGMGCMVALFQDSIAWSNKSMPSSLSAIS 
OSPCQVQAPEGPSSFHLPXLSFTTCLSWQGGD 
LEFLGDLKGCSELKNFQELITQSALVHPKADV 
WWYCORPLLGTLPSN 


782 


2132 


A 


6281 


1324 


393 


WISLPSSLLCRKNGSSAEDDRRVGEPSAEEAEG 

.EREDWGIGSA»SVGAVSKVPSARF*RTYPS\E 

DEEEVTHQKSSSSDSNSEEHRKKKTSRSRNK 

KKRKNKSSKRKHRKYSDSDSNSESDTKSDSD 

DDKKRVKAKKKKKKKKHKl^^ 

ESSDSSCKDSEEDLSEATWMEQPNVADTMDL 

IGPEAPnHTSQDEKPLKYGHALLPGEOAAMA 

EYVKAGKRIPRRGEIGLlsiiiiiucfcrii^oiJi vwi 

SGSRHRRMEAVRLRKENQIYSADBKRALASF 

NQEERRKRESKIIASFREMVHKKTKGKDDK 


783 


2133 


A 


6305 


201 


1032 


' "WDDYPQGALRRREAAEOLHFLGPPGRVRU^^ 
LRGITGPAWYCHSPSHSLLSAFCHUPTPSRCP 
AMARPPVPGSVWPNWHES/RRGQGVFGLHS 
AQEPPAGVWAA*AASAAAA\LSn>TASYKIFV 
SGKSGVGKTALVAKLAGLEVPWHHETTGIQ 
TTVVFWPAKLQASSRVVKffRFEFWDCGESA 
LKKFDHMLLACMENTDAFLFLFSFTDRASFE 
DLPGQLARIAGEAPGWRMVIGSKFDQYMHT 
nVPERDLTAFRQAWELPLLRVKSVPGRRLG 


784 


2134 


A 


6308 


86 


96 


- GSSPDPASLITMKNQDKKNGAAKQSNPKSbP 
GOPEAGPEGAQERPSQAAPAVEAEGPGSSQA 
PRKPEGAQARTAQSGALRDVSEELSRQLEDIL 
STYCVDNNQGGPGEDGAQGEPAEPEDAEKSR 
TYVARNGEPEPTPWNGEKEPSKGDPNTEEIR 
OSDEVGDRDHRRPQEKKKAKGLGKEITLLM 
OUNaSTPEEKIJ>kALCKKYAELLEEHRNSQ 
KONDCLLQKKOSOLVOEKDHLRGEHSKAVLA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aianinc CMJysteine^ 
I>»Aspartic Acid, EKiIutamic Acid, 
F^Phenylalanine, GOlycine, H»Histidine» 
I=Isoleucine. K=Lysine, I/=Leucine, 
M=Methionine. N=Asparagine, P=ProIine. 
QsOlutamine, R=Argininc, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y'^Tyrosine, X=Unknown, *'*Stop codon, 
/=T50Ssible nucleotide deletion, \p^ssiblc 
nucleotide insertion 














RSKLESLCRELQRHNRSLKEEGVQRAREEEE 

KRKEVTSHFQVTLNDIQLQMEQHNERNSKLR 

QENMELAERLKKLIEQYELREEHIDKVFKHK 

DLQQQLVDAKLQQAQEMLKEAEERHQREKD 

I^LKEAVESQRMCELMKQQETHLKQQLALY 

TEKFEEFQNTLSKSSEVFTTFKQEMEKMTKKI 

KJXEKETTMYRSRWESSNKALLEMAEEKTV 

RDKELEOLQVKIQRLEKLCRALQT/GAQ*PVR 

GORWGSHRTSAVRIFS 


785 


2135 


A 


6319 


1493 


889 


SPQGPLLRSVSPVSAGASSVTPCiUAgi'UVri 1 

PPSLVAVAPAPGSAAGPAAGWQ*HAGCRAVT 

KLPWSWGMRPMKIFFSEEYRSISTRISHDAL* 

EKCTQPAKPLSMIRNTGSSVSPG/PLVKWNWT 

RREFRNSGTRWSSCCGMSCMYSFLGHCSV/S 

QDLPLVHVDVGWQPPLGPTVGLRPOLLPLHD 

TTPCOKLWDDLDWA 


786 


2136 


A 


6320 


531 


135 


RWLPVAECDSSCVGCTGEGPGNCKECISGYA 

REHGQCADVDECSLAEKTCVRKNENCYNTP 

GSYVCVCPDGFEET/RRCLCAAGRG*SHRRRK 

PDTAALPRRPVMCRTYPLhfySEGCPVENVAL 

RMPSPAVDSGGERLPAL 


787 


2137 


A 


6330 


1693 


227 


DYVLTAELHRQRSPGVSFGLSVFNLMNAIMU 

SGILGLAYVMANTGVFGFSFLLLTVALLASYS 

VHLLLSMCIQTAYLGP*TNYFMVLPAH*LTCL 

PLIEFLQSL*NSL\*AVTSYEDLGLFAFGLPGKL 

WAGTIUQNIGAMSSYLLmCTELPAAIAEFLT 

GDYSRYWYLDGQTLLIUCVQIVFPLALLPKIG 

FLGYTSSLSFFFMMFFALVVnKKWSIPCPLIL 

im^KGFQISNVTDDCKPKLFHFSKESAYALP 

TMAFSFLCHTSILPIYCELQSPSKKRMQNVm 

TAIALSFLIYFISALFGYLTFYD/GTTKAQRGE 

VTCHRIKDKVESELLKG***IP*SHDVVV\mV 

KLCIIJAVLL\mLIHFPARKAVTMMFFSNFP 

FSWIRHFLITLALNinVLUIYVPDIRNS^ 

GASTSTCUFEFPGLFYLKLSREDFLSWKKLGV 

GCFCAXSFKTSILRNSLSVYnLPASRKSIYFKI 


788 


2138 


A 


6351 


1 


-s?s 


prslcfslwaeAAvLad(Sglrrrrrllrutm 

sasfvpngasledchcnlfcladltoikwkk 

yvwqgptsapilfmeedpilssfsra.kadv 

lga^wrrdqrperre\l*ifwogedp\vlltlf 

tmtyqkkkmecgrmdfpmnavlcafskavh 

NLLERCLMNRNFVRIGKWFVKPYBKDEKPIN 
KSEHLSCSFTFFLHGDSNVCTSYEINQHQPVY 

llseehttlaqqsnspfqvilcpfolngtltgq 

afkmsdsatkkligewkqfypiscclkemse 

ekqedmdweddslaavevlvagvrmiypac 

fvlvpqsdiptpspvgsthcsssclgvhqvpas 

trdpamssvtltpptspeevqtvdpqsvqkw 

vkfssvsdgfnsdstshhggkiprklanhw 

drvwqeo^imnraqnkrkysassoolceeat 

aakvaswdfveatqrtncsclrhknlksrn 

agqqgqapslgqqqqilpkhktnekqeksek 

pqkrpltpfhhrvsvsddvgmd\ads\asqrl 

v\isap\dsq\vrfsnir\tndvak\tpqmhgte 

manspqppplsp\hpcdvvdegvtktpstpqs 

qhfyqmptpdplvpskpmedridslsqsfppq 

yqeaveptvyvgtavnleedeaniawkyyk 

fpkkkdveflppqlpsdkfkddpvgpfgqesv 

tsvtelmvqcackplkvsdelvqqyqiknqcl 

saiasdaeoepkidpyafvegdeeflfpdkkd 
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RQNSEREAGKKHKVEDGTSSVTVLSHEEDA 

MSLFSPSKQDAPRPTSHARPPSTSLIYDSDLA 

VSYTDLDNLFNSDEDELITGSKRSANGSDDK 

ASCKESKTGNLDPLSCISTADLHKMYPIWSL 

EQHIMGFSPMNMNl^YGSMDTTPGGTVLE 

GNSSSIGAQFKIEVDEGFCSPKPSEIKDFSYVY 

KPENCQILVGCSMFAPLKTLPSQYLPLIKLPEE 

CIYRQSWTVGKLELLSSGPSMPFIKEGDGSNM 

DQEYGTAYTPQTHTSCGMPPSSAPPSNSGAGI 

LPSPSTPRFPTPRTPRTTOTPRGAGGPASAQGS 

VKYENSDLYSPASTPSTCRPLNSVEPATVPSIP 

EAHSLYVNLILSESVMNLFKDCNSDSCX3CVC 

NMNIKGADVGVYIPDPTQEAQYRCTCGFSAV 

MNRKFGNNSGLFFEDELDnGR2m)CGKEAE 

KRFEALRATSAEHVNGGLKESEKLSDDLILLL 

QDQCTNLFSPFGAADQDPFPKSGVISNWVRV 

EERDCaiDCYLALEHGRQFNIDNMSGGKVDE 

ALVKSSCLHPWSKRNDVSMQCSQDILRMLLS 

LQPVLQDAIQKKRTVRPWGVQGPLTWQQFH 

KMAGRGSYGTDESPEPLPIPTFLLGYDYDYLV 

LSPFALPYWERLMLEPYGSQRDIAYWLCPE 

NEALLNOAKSFFRDLTAIYESCRLGQHRPVSR 

LLTDGIMRVOSTASKKLSEKLVAEWFSQAAD 

GNNEAFSKLKLYAQVCRYDLGPYLASLPLDS 

SLLSQPNLVAPTSQSLITPPQMTNTGNANTPS 

ATLASAASSTMTVTSGVAISTSVATANSTLTT 

ASTSSSSSSNLNSGVSSNKLPSFPPFGSMNSNA 

AGSMSTQANTVQSOQLGGQQTSALQTAGISG 

ESSSLPTQPHPDVSESTMDRDKVGIPTDGDSH 

AVTYPPATVVYIIDPFTYENTDESTNSSSVWTL 

GIXRCFLEMVQTLPPHIKSTVSVQIIPCQYIXQ 

PVKHEDREIYPQHLKSLAFSAFTQCRRPLPTS 

TNVKTLTGFGPGLAMETALRSPDRPECIRLYA 

PPFILAFVKDKQTELGETFGEAGQKYNVLFV 

OYCLSHDQRWILASCTDLYGELLETCIINIDVP 

NRARRKKSSAKKFGLQJUy Wli wci-ui- v v^iviisis 

LPWRWIGRLGRIGHGELKDWSCLLSRRNLQ 

SLSKRLKDMCRMCGISAADSPSILSACLVAM 

EPQGSFVIMPDSVSTGSVFGRSTTLNMQTSQL 

NTPQDTSCmiLVFPTSASVQVASATYTTENL 

DLAFNPNNDGADGMGIFDLLDTGDDLDPDn 

NILPASPTGSPVHSPGSHYPHGGDAGKGQSTD 

RLLSTEPHEEVPNILQQPLALGYFVSTAKAGP 

LPDWFWSACPQAQYQCPLFLKASLHLHVPSV 

QSDELLHSKHSHPLDSNQTSDVLRFVLEQYN 

ALSWLXroPATQDRRSCLPIHFVyLNQLYNFI 

MNML 


789 


2139 


A 


6359 


1 


2002 


■ "TGTLTEDGLDVMGVVPLKGQAFLPLVPEFKK 
LPVGPLLRALATCHALSRLQDTPVGDPMDLK 
MVESTGWVLEEEPAADSAFGTQVLAVMRPP 
LWEPQLQAMEEPPVPVSVLHRFPFSSALQRM 
SVWAWPGATQPEAYVKGSPELVAGLCNPET 
VPTDFAQMLQSYTAAGYRWALASKPLPSVP 
SLEAAQQLTRPTVEGDLSLLGLLVMRNLLKP 
QTTPVIQALRRTRIRAVMVTGDNLQTAVTVA 
RGCGMVAPQEHLnVHATHPERGQPASLEFLP 
MESPTAVNGVKDPDQAASYTVEPDPRSRHLA 
LSGPTFGIIVKHFPKLLPKVLVQGTVFARMAP 
EQKTELVCELQKLQYCVGMCGDGANDCXjAL 
KAADVGISLSQAEASWSPFTSSMASIECVPM 
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in 
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Amino acid sequence (A=Alanme 0=Cystcine. 
D=AspartJc Acid, EF=Glutanuc Acid, 
F=Phenylalanine, G=<jlycine» KNHistidine, 
Msoleucine, K'^Lysine, L^Leucin^ 
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Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
YtaTvrosine. X'^UnknowiL *^Stop codon, 
/=possible nucleotide ddetioD. V=possible 
nucleotide insertion 














VSEGRCSLDISI.'SVFKYMALYSLTQFISVLU- 

YTimmGDIXJFLAIDLVnTTVAVLMSRTGP 

ALVLGRVRPPGALLSVPVLSSLLLQMVLVTG 

VQLGGmXLAQPWFVPLNRTVAAPDNLPNY 

ENTVVFSLSSFQYLILAAAVSKOAPFRVRPLTN 

NVPFLLASAL*SSVLVVLVLSPGLLHGPLALR 

>nTDTGFKLLLVGLVTLNFVGGLHAGERARP 

VPPRLPAPPPAQAG\SKKRFKQLERELAEQPW 


790 


2140 


A 


6380 


76 


1059 


"SSAGSARKLQVMALAARLWKLLPFRRGAAi' 
GSRJLPAGTSGSRGHCGPCRFRGFEVMONPGT 
FKRGLLLSALSYLGFETYQVISQAAWHATA 
KVEEILEQADYLYESGETEKLYQLLTQYKESE 
DAELLWRLARASRDVAQLSRTSEEEKKLLVY 
EALEYAKRA/L/EKNESSFASHKWYAICLSDV 
GDYEGTKAKIANAYIIKEHFEKAIELNPKDATS 
IHLMGIWCYTFAEMPWyQRRIA*NACLQLFP 
»FPPYEKALG\YFHRAEQVDPNFYSKNLLLLG 
KmKLHNKKLAAFWLMKAKDYPAHTEED 

KpiOTEAAOLLTSFSEKN 


791 


2141 


A 


6434 


3 


1460 


1ALLIVDGLAWDDQGGLALLHISPSKLIL*QDS 

SGMS/yVMVRCTITRAFFKSLLCHICQYSIGPQ 

♦\mCPGQDACKE*KSTAN'GG*RE«PQVLFF 

AFLSNPAVKFGRMSKKQRDSLYAEVQKHQQ 

RLOEORQQQSGEAEALAK V i o& oian ui-oi^ 

KETSGTYANGSVIDLPKSEGYYNWSGQPSP 

DQSGLDMT\QIKQIKQEPIYDLTSVPNLFTY\SS 

FmGQLAPGIT\MTEIDRIAQNnKSHLETCQY 

TMEELHQLAWQTHTYEEIKAYQSKSREALW 

OQCAIQrrHAlQYWEFAKRITGFMELCQNDQ 

ILLLKSGGLEVVLVRMCRAFNPLNNTVLFEG 

KYGGMQMFKALGSDDLVNEAFDFAKNLCSL 

r\i Tcc-cTAT PQQAVr ISPDRAWLIEPRKVOKLQ 

EKIYFALQHVIQKlfflLDDETLAKLIAKIPTn'A 

VCNLHGEKLQVFKQSHPEIVNTLFPPLYKELF 

NPDCATACK 


792 


2142 


A 


6440 


92 


781 


SRGTFRCFCRDFFPCFSNMKLFLWNAVLTLl" Y 

TSLIGALIPEPEVKIEVLQKPFICHRKTKGGDL 

MLVHYEGYLBKDGSLFHSTHKHNNGQPIWFT 

LGE-EALKGWGPGA*K/DMCVGEKRKLnPPA 

LGYGKEGKGKIPPESTLIFNIDLLEIRNGPRSH 

ESFQEMDLNDDWKLSKDEVKAYLKKEFEKH 

GAWNESHHDALVEDIFDKEDEDKDGFISAR 

EFTYJOIDEL 


793 


2143 


A 


6446 


3201 


152 


■ T»iaJaRLVVTEEDGGARPEALGKlAPRTPAliLO 
ARADQELVTALMCDLRRPAAGGMMDLAYV 
OSWEKWSKSTHCPSVPLACAWSCRNLIAFTM 
DUlSDDQDLTWVflHnjyrEHPWDLHSIPSEHH 
EAm^LEWDQSGFPOFLFSRWPTGQlK\CWS 
MGVSTLAVNSWESSSVGSL\VEGGPHLWALb\ 
WIJHW4GVKLALHVEKSGASSFGEKFSR\VKFS 
P\SLTLF\OGNAMEGWIAVTVSGLVTVSLLQ\P 
S0QVL\TS1\ESLCRLRARVALADIAFTGGGNI 
WATADGSSANSPVQFYKVCVSWSEKCRIDT 
DILPSIJFMRCnTOLNRKDKFPAimKFL^ 
MSEQVLLCASSQTSSIVBCWSLRKEGLPVNNI 
FQQISPWGDKQPTILKWRJLSATNDLDRVSA 
VUU>KIJISLTNTOLKVASDTQFYPGU}LAL 
AFHDGSVHIVHRLSLQTMAVFYSSAAPRPVD 
EPAMKRPRTAGPA\raJCAMOI^WTSLALVG 
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Amino acid sequence (A«=Aianinc OCysteino, 
I>:Aspartic Acid, E=Glutamic Acid, 
F«Phenylalanine, G^Glycincj, H^Histidine, 
l=Isoleucine, K^Lysine, I^Lcucinc, 
M=Metiiionine, N=Asparagine» P=PioUne, 
Q=Glutamine, R=Arginine, S^Scrinc, 
T=Threonine, Vs'Valine, W^Ttyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon, 
/^possible nucleotide deletion, \ppossiblc 
nucleotide insertion 














"lDSHGKLSVU.RLSPSMGHPLEVGLALRHJULtL 
LEYCMVTGYDWWDILLHVQPSMVQSLVEKL 
HEEYTRQTAALQQVLSTRILAMKASLCKLSP 
CTVTRVCDYHTKLFLIAISSTLKSLLRPHFLNT 
PDKSPGDRLTEIGTKTTDVDIDKVMINLKTEEF 

QPCPTSEPCPTSEPSPTSEPSPTSEPSSP*SLaG 

CT 1 Tjur-UCTTT PTVlTqi GMLRELMWIRIWGLL 

KPSCLPVYTATSDTQDSMSLLFRLLTKLWICC 

RDEGPASEPDEALVDECCLLPSQLLIFSLDWL 

PASDGLVSRLQPKQPLRLQFGRAPTLPOSAAT 

LQLDGLARAPGQPKTOHLRRLHLOACaPTEEC 

KACI^CGCVTMLKSPNRTTAVKQWEQRWDC 

NCVLVRWALVAGAPQLPLSPAAPQLLLSVPSA 

APEPGCCK-oriKor W i JUJL*w^ v inliOX' r\i«xvn. ▼ viw 

RGPDACVTSRASEEAPAFVQLGPQSTHHSPRT 
PRSLDHLHPEDRP 


794 


2144 


A 


6490 


418 


585 


NGDKADLENESOlAQVLMPVVPALWEAbOU 
n55rEPRDLRLO*AVITPL\TPAWVTQ 


795 


2145 


A 


6499 


395 


1027 


KLLWLPPHSEQKRSPLYHPQGPSGrrPSAi\l-S 

SHSPPPSLLQANPSIAAFLRTHGHISASOPLRMP 

FPHm*NAFLLVFPGQRSQLTS/PSHYLayBVFP 

DHHHHLCRLaESSPLFHHRVLFCVPKQNVN 

STRAQIFCLFVHIVGCaiCINTFPLHLFRLHLWL 

T-rriT r\mi owTwTVoxfVT rsifTWirROPOSAAGS 

HFLQIPLClUN^ivaVJvJLAjJvl v vurR.vjoy«3xvrwvju 

DTRVRAAVGAPGLPVEPLV 




2146 


A 


6503 


68 


936 


HSALLTHSSFCVFTLCQDm'YSSMSEEVTYA 
DLQFQNSSEMEKIPEIOKFGEKAPPAPSHVWR 
PAALFLTLLCLLLLlGLGVlJ^SMFHVTLKIEM 
KKMNKLQNISEELQRNISLQLMSNMNISNKIR 
NLSrrTLQmTKLCRELYSKEQEHKCKPCPRR 
WIWHKDSCYFLSDDVQTWQESKMACAAQN 
ASLLKINNKNALEFIKSQSRSYDYWLGLSPEE 
DS/YSWYESG*YNQ\PSAWVIRNAPDLNNMy 
CGYINRLYVQYYHCTYKQRMICEKMANPVQ 

irySTYFREA 


797 


2147 


A 


6507 


1 


881 


- pGSrriASARSQVPRSAGEAAPHSRRPPGLLFH 
APRAASAQLEERMRDPHPG^TTLQEGDCRGS 
OTVSLTMGTADSDEMAPEAPQHTHIDVHIHQ 
ESALAKLLLTCCSALRPRATQARGSSRLLVAS 
WVMQIVLGILSAVLGGFFYIRDYTLLVTSGA 
AIWTGAVAVLAGAAAFIYEKRGGTYWALLR 
TLLALAAFSTAIAALKLWNEDFRYGYSYYNS 
ACRISSSSDWNTPAPTQSPEEVRRLHLCTSFM 
nvn VAl vrm OAMLLGVWHJLLLASLTPLWL 
/SL/RGECSQPKG*VPKKRDQKEMLEVSGI*PG 
STHASARSQVPRSAGEAAPHSRRPPGLLPHAP 
RAASAQLEERMRDPHPGMTLQEGDCRGSQT 
VSLTMGTADSDEMAPEAPQHTHIDVHlHQhb 
ALAKLLLTCCSALRPRATQARGSSRLLVASW 
VMQIVLQILSAVLGGFFYIRDYTLLVTSGAAI 
WTGAVAVLAGAAAFIYEKRGGTYWALLRTL 
LALAAPSTAIAALKLWNEDFRYGYSYYNSAC 
RISSSSDWNTPAPTQSPEEVRRLHLCTSFMDM 
LKALFRTLQAMLLGVWILLLLASLTPLWLYC 
WRMFPTKGVSP 


798 


2148 


A 


6528 


912 


2287 


■ ■ vPNYLPSVSSAIGGEVPQRYVWRFClCrUlSAr 
Wl^VAFAYWNHYLSCTSPCSCYRPLCRLNFG 
UJVVENUOLVLTYVSSSEDF/TWVPG^GRSG 
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Amino acid sequence (A=Aianlne OCysteine, 
I>Aspartic Acid, &=Glutaniic Acid, 
F=Phenylalanine, G=Glycine. H=Histidine. 
I=Isoleucine, K^Lysine, L^Leucine, 
M=Methioninc,N=Asparagine, P^Prolinc, 
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T«Threonine, V=VaIine, W«=TiyptQphan, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
A=possibIe nucleotide deletion, possible 
nucleotide insertion 

T-\rcT>x2nT'ni DT PW^FiT PT^WnGHSLOCGSOSS 














FPPAIHENAFIVFIASSLGHMLLTCILWRLTKK 

HrVSQE\DGLSLAGAPRQPRRKSRTSVLRIRV 

MVRWELSSNGNPGROVLGLGLGLGNKLRW 

GQNLGL*HCVWVVWETGE*KRWRLQMGIE* 

GVASRRQ*VRNSVRGLVCHNSSAPPMYMGFF 

SPTVFGGGVGG*LHVTFILHPPEVEAAGPLLL 

GPSLPQRQGREHIVVILAAPACAPFHDR*WEP 

REIRPSP*ELGLRGEFaSYPASCRVlRQPIP*D 

RKSYSWKQRLFIINFISFFSALAYYFRHNMYC 

EAG VYTIFAlLEYTVv L 1 NMArnM i a w w ljp 

GNKELLITSOPEEKRF 


799 


2149 


A 


6529 


1 


874 


FFFFQRINFIEHSGSVSLLALACDLGWCbUWy 

CCLVQGGGDLVDWQTNHGEDEAGGDTDSV 

DEARCKESQQEAQENLREDLCLESFAKDKIL 

QIIEGSEREHEETRTKQAALDGEPLGGGQLTA 

VHLHPSKEQQGQEGGERQRGARTHHWRGW 

EKGRRVRLRPPSGKLRADvJrVKRXVAjr iro/ 1 

ELPGLQPHAPTPHTA/PATPrySPAPDTPNPPV 

RWKCPLPVEPRTRQLCRERTRKACPPKPRPPL 

GLPODPTGPVTHHAPPVSPTQASGQERRAEP 

GAVSYAHASATK 


800 


2150 


A 


6544 


2 


662 


SAQRWAAVAGRWGCRLLA1XLLVPGPUUA5 

EITFELPDNAKQCFYEDIAQGTKCTLEFQVTTG 

GHYDVDCRLEDPDGKVLYKEMKKQYDSFTF 

TASKNGTYKFCFSNENFSTFniKTVYFDFQVO 

EVTHLCFLVR/DRVSALTQMESACVSIHEALKS 

VIDYQTHFRLREAQGRSRAEDLNTRVAYWSV 

GEALILLVVSIGOVFLLKSFFSDKRTnTRVGS 


801 


2151 


A 


6556 


1 


1319 


"tPCMECIKGEGLREPQNLSGSQREPQTiiliSM 

DGWRRMPRWGLLLLLWGSCTFOLPTUTi If 

KRIFUCRMPSIRESLKEROVDMAKLOris w av^r 

MKRLTLGNTTSSVILTNYMDTQYYGEIGIGTP 

PQTFKWFDTGSSNVWVPSSKCSRLYTACVY 

HKLFDASDSSSYKHNGTELTLRYSTGTVSGFL 

SQDIITVGGnvrQMFGEVTEMPALPFMLAEF 

DGVVGMGFDBQAIOlVTPIFDNnSQGVLKED 

VFSFYYNRDSENSQSLOGQIVLGGSDPQHYE 

GNFHYINLIKTGVWQIQMKGVSVGSSTIXCE 

DGCLALVDTGASYISGSTSSIEKLMEALGAKE 

KRIJDYVVKCNEGPTLPPTFLFLLGGKDIPLT 

SADYLFQESYSSKKI^TLAIHAMYIPPPTXJPTL 

NALGATPORKFYTEFDRGNNPHGFALAR 


802 


2152 


A 


6567 


13 


6147 


■ MCLGRMGASSPRSPEPVGPPAPGLPFCCOtibL 
LAVVVLLAIJVAWGQCWAPEWUJTARPTNL 
TDEFEFPIGTYLNYECRPGYSGRPFSnCLKNS 
VWTGATORCRRKSCRNPPDPVNGMVHVIKG 
lOFGSQKYSCTKGYRLIGSSSATCnSGDTVIW 
DNETPICDRIPCGLPPTITNODFISTNRENFHY 
GSWTYRCNPGSGGRKVFELVGEPSIYCTSND 
DQVGIWSGPAPQCnPNKCTPPNVENGE-VSD 
NRSLFSLNEWEFROQPGFVMKGPRRVKCQA 
LNKWEPELPSCSRVCQPPPDVLHAERTQRDK 
DNFSPGQEVFYSCEPGYDLRGAASMRCTPQG 
DWSPAAPTCEVKSCDDFMGQLLNGRVLFPV 
NLQLGAKVDFVCDEGFQLKGSSASYCVLAG 
MESLWNSSVPVCEQIFCPSPPVIPNGRHrGKP 
LEVFPFGKAVNYTCDPHPDRGTSFDLIGECTIR 
CTSDPQGNGVWSSPAPRCOaGHCQAPDHFL 
FAKLKTQTNASDFPIGTSLKYECRPEYYGRPF 
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Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, EFGlutamic Add, 
F=Phenylaiaiijne, G=Qlycinc, »=»Histjdine, 
Msoleucinc, K=Lysinc, LFLcucine, 
M^Methionine, N^Asparagine, P«Pioline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y«Tyit)Sine, X«Unknown, *=»Stop codon, 
/==possible nucleotide deletion, V=possible 
nucleotide insertion 














■ SrrCLDNLVWSSFKDVCKRKSCKTPPDPVNG 

MVHVTTDIQVGSRINYSCTTGHRLIGHSSAECI • 

LSGNAAHWSTKPPICQRIPCGLPPTIANGDFIS 

TNRENFHYGSWTYRCNPGSOGRKVFELVGE 

PSIYCTSNDDQVGIWSGPAPQCllPNKCTPPNV 

ENGDLVSDNRSLFSLNEWEFRCQPGFVMKGP 

RRVKCQALNKWEPELPSCSRVCQPPPDVLHA 

ERTQRDKDNFSPGQEVFYSCEPGYDLRGAAS 

MRCTPQGDWSPAAPTCEVKSCDDFMGQLLN 

GRVLFFVNLQLGAKVDFVCDEGFQLKOSSAS 

YCVLAGMESLWNSSVPVCEQIFCPSPPVIPNG 

RHTGKPLEVFPFGKAVNYTCDPHPDRGTSFD 

LIGESTIRCTSDPQGNGVWSSPAPRCGILGHC 

OAPDHFLFAKLKTQTNASDFPIGTSLKYBCRP 

•EYYGRPFSrrCLDNLVWSSPKDVCKRKSCKTP 

PDPVNGMVHVTTDIQVGSRINYSCTTGHRLIG 

HSSAECILSGNTAHWSTKPPICQRPCGLPPn 

ANGDnSTNRENFHYGSWTYRCNLGSRGRK 

VFELVGEPSIYCTSNDDQVGIWSGPAPQCnPN 

KCTPPNVENGILVSDNRSLFSLNEWEFRCQP 

GFVMKGPRRVKCQALNKWEPELPSCSRVCQ 

PPPEILHGEHTPSHQDNFSPGQEVFYSCEPGY 

DLRGAASLHCTPQGDWSPEAPRCAVKSCDDF 

LGQLPHGRVLFPLNLQLGAKVSFVCDEGFRL 

KGSSVSHCVLVGMRSLWNNSVPVCEHIFCPN 

PPAILNGRHTOTPSGDIPYGKEISyTCDPHPDR 

GMTPNUGESmcrSDPHGNGVWSSPAPRCE 

LSVRAGHCKTPEQFPFASPTIPINDFEFPVGTS 

LNYECaa»GYFGKMFSISCLE^OLVWSSVEDNC 

RRKSCXjPPPEPFNGMVHINTDTQFGSTVNYSC 

T^GFRUGSPSTT<XVSGNKSnrWDKKAPICEn 

SCEPPPnSNGDFYSNNRTSFHNGTWTYQCH 

TGPDGEQLFELVGERSIYCTSKDDQVGVWSS 

PPPRCISTNKCTAPEVENAIRVPGNRSFFSLTEI 

IRFRCQPGFVMVGSHTVQCQTNGRWOPKLPH 

CSRVCXJPPPEDLHGEHTLSHQDNFSPGQEVFY 

SCEPSYDLRGAASLHCii%^uworiinrxs.v^i v 

KSCDDFLGQLPHGRVLLPLNLQLGAKVSFVC 

DBGFRLKGRSASHCVLAGMKALWNSSVPVC 

EQIFCFNPPAILNGRHTGTPLGDIPYGKEVSYT 

CDPHPDRGMTFNLIGESTIRRTSEPHGNGVWS 

SPAPRCELPVGAACPHPPKIQNGHYIGGHVSL 

YLPGMTISYTCDPGYLLVGKGFIFCTDQGIWS 

OUJHYCKEVNCSFPLFMNGISKELEMKKVYH 

YGDYVTLKCEDGYTLEGSPWSQCQADDRWD 

PPl^CTSRTHDALIVGTLSGTIFFILLnFLSWI 

ILKHRKGNNAHENPKEVAIHLHSQGGSSVHP 

RTLOTNEENSRVLP 


803 


2153 


A 




2 


3233 


■ HGRSARLAAVPAEAMPGPRRPAGSRLKLLLL 
LLLPPJuLLLLRGVSHAGNLTVAWLPLANTSY 
PWSWAVRVGPAVELALAQVKARPDLLPGWT 
VRTVLGSSENALGVCSDTAAPLAAVDLKWE 
HNPAVFLGPGCVYAAAPVGRFTAHWRVPLL 
TAGAPALGFGVKDEYALTTRAGPSYAKLGDF 
VAALHRRLGWERQALMLYAYRPGDEEHCFF 
LVEGLFMRVRDRLNITVDHLEFAEDDLSHYT 
RLLRTMPRKGRVIYICSSPDAFRTLMLLALEA 
GLCGEDYVFFHLDIFGQSLQGGQGPAPRRPW 
ERGDGQDVSARQAFQAAKIITYKDPDNPEYL 
EFUCQLKinAYEQFNFTMEDOLVKnPASFH 



246 




PCT/USOl/03800 

WO 01/57188 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 
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Amino acid sequence (A=Alanine OCysteine, 
I^Aspartic Acid, E«Ohitamic Acid, 
F-Phenylalaninc, 0=<jlycinc, H=Histidine, 
Msoleucine, K^Lysine, L?=Leucine, 
M=Methionine. N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginme, S=Serine, 
T^Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, ♦«Stop codon, 
/«=possible nucleotide deletion. \Fpossiblc 
nucleotide insotion 














DGLIXYIQAVTETLAHGGTVTDGENITQRMW 

NRSFQGVTGYLODSSGDRETDFSLWDMDPE 

NGAFRWLNYNGtSQELVAVSGRKLNWPLG 

YPPPDIPKCGFDNEDPACNQDHLSTLEVLALV 

GSLSLLOILIVSFFmKMQLEKELASELWRVR 

WEDVEPSSLERHLRSAGSRLTLSGRGSNYGSL 

LTTEGQFQVFAKTAYYKGNLVAVKRVNRKR 

ffiLTRKVLFELKHMRDVQNEHLTRFVGACTD 

PPNICILTEYCPRGSLQDILENESITLDWMFRy 

SLTNDIVKGMLFLHNGAICSHGNLKSSNCVV 

DGRFVLKTTDYGLESFRDLDPEQGHTVYAKK 

LWTAPELLRMASPPVRGSQAGDVYSFGULQE 

lALRSGVFHVEGLDLSPKElIERVTRGEQPPFR 

PSLALQSHLEELGLLMQRCWAEDPQERPPFQ 

QIRLTLRKFNRENSSNILDNLLSRMEQYANNL 

EELVEERTQAYLEEKRKAEALLYQILPHSVAE 

QLKRGETVQAEAFDSVnYFSDIVOFTALSAE 

STPMQVVTLLNDLYTCFDAVIDNFDVYKVET 

IGDAYMWSGLPVRNGRLHACEVAR^4ALAL 

LDAVRSFRIRHRPQEQLRLRIGIHTGPVCAGV 

VGLKMPRYCLFGDTVNTASRMESNGEAL\KI 

HLSS\ETKAVL\EEFGOFELELRGDVEMKOKG 

KVRTYWLLGERGSSTRO 


804 


2134 


A 


6585 


2 


3837 


DAPGRPPVRLPTMELEDGWYQEEPGGSUAV 

MSERVSGLAGSIYREFERLIVRYDEEWKELIP 

LWAVLENLDSVFAQDQEHQVELELLRDDNE 

QLrrQYEREKALRKHAEEKFDSFEDSQEQEKK 

DLQTKVESLESQTOQLELKAKNYADQISILEE 

REAELKKEYNALHQRHTEMIHNYMEHLERT 

KLHQLSGSDQLESTAHSRIRKERPISLGIFPLP 

AGDGLLTPDAQKGGETPGSEQWKFQELSQPR 

SHTSLKDELSDVSQGGSKATITASTANSDVA 

TIPTDTTLKEENEGFVKVTDAPNKSEISKHIEV 

QVAQETRNVSTGSAENBEKSEVQAIIESTPEL 

DMDKDLSGYKOSSTnKGIENKAFDRNTESL 

FEELSSAGSGLIGDVDEGADLLGMGREVENLI 

LENTQLIJ2TKNALNIVKNDLIAKVDELTCEK 

DVLQGELEAVKQAKLKLEEKNRELEEELRKA 

RAEAEDARQKAK0DDDSDIPTAQRKRFTRVE 

MARVLMERNQYKERLMELQEAVRWTEMIR 

ASRENPAMQEKKRSSIWQFFSRLFSSSSNTTK 

KPEPPVNLKYNAPTSHVTPSVKXRSSTLSQLP 

GDKSKAFDFLSEETEASLASRREQKREQYRQ 

VKAHVQKEDGRVQAFGWSLPQKYKQVTNG 

QGENKMKNLPVPVYLRPLDEKDTSMKLWCA 

VGVNLSOQKTRDGGSWOASVFYKDVAGLD 

TCGSKQRSASQSSLDKLDQELKEQQKELKNQ 

EEUSLVWICTSTHSATKVUIDAVQPGNILDS 

FTVCNSHVLOASVPGARETDYPAGEDLSESG 

QVDKASLCOSMTSNSSAETDSLLGGITVVGC 

SAEGVTGAATSPSTNGASPVMDKPPEMEAEN 

SEVDENVPTABE\ATEATEGNAGSAEDTV\DIS 

QTGVYTEHVFTDPLGWQIPEDLSPVYQSSND 

SDAYKDQISVLPNEQDLVREEAQKMSSLLPT 

MWLGAQNGCLYVHSSVAQWRKCLHSIKLKD 

SILSIVHVKGIVLVALADGTLAIFHRGVDGQW 

DLSNYHLU)LGRPHHSIRCMTVVHDKVWCO 

YRNKIYWQFKAMiaEKSFDAHPRKESQVRQ 

LAWVGDGVWVSIRLDSTLRLYHAHTYQHLQ 

DVDrEPYVSKMLGTGKLGFSFVRTTALMVSC 
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nucleotide 
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peptide 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A»Alanine OCystem^ 
I>=Aspartic Acid, EF^Glutamic Acid, ^ 
F=PhenyIalanine, G=01ycine, H^Histidine, 
l=Isoleucine, K=Lysine, LpLeucinc, 
M^Methionine, N=Aspaiagine, P=Proline, 
Q=<jlutamine, R=Arginine, S=Serme, 
T^Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosinc, X^Unknown, *«Stop codon, 
/^possible nucleotide deletion, )rT)ossible 
nucleotide insertion 














NRLWGTONGVnsrPLTElVILHQGRLLULR 
ANKTSGVPGNRPGSVIRVYGDENSDKVTPGT 
FIPYCSMAHAQLCFHGHRDAVKPPVAVPGQV 
ISPOSSSSOTDLTODKGRGHLHRSLWRRP 


805 


2155 


A 


6605 


469 


2602 


FGRLLWGTAFKSWKMKAFIPHLILLY Al 1* 1 g 

SLKVVTKRGSADGCTDWSIDIKKYQVLVGEP 

VRIKCALFYGYIRTNYSLAQSAGLSLMWYKS 

SGPGDFEEPL4JT)GSBMSKEEDS1WFRPTLLQ 

DSGLYACVERNSTYCMKVSISLTVGENDTGL 

CYNSKMKYFEKAELSKSKEISCRDIEDFLLPT 

REPEn.VmJEaiTKTWRPSIVFKRDTLLIREV 

REDDIGNYTCELKYGGFWRRTTELTVTAPL 

TDKPPKLLYPMESKLTIQETQLGDSANLTCRA 

FFGYSGDVSPLnrWMKGEKFIEDLDENRVWE 

SDIVKILKEHLGEQEVSISLIVDSVEEGDLGNYS 

CYVENGNGRWIASVIXHKRElMYTVBLAuo 

LGAILLLLVCLVTIYKCYKIEIMLFYKNHFGA 

EELDGDNKDYDAYLSYTKVDPDQWNQETGE 

EERFALEILPDMLEKHYGYKLFBPDRDLIFTGT 

YDEDVARCVDQSKRLnVMTPNYWRRGWSIF 

ELETRLRNMLVTGEIKVILIECSELRGIMNYQE 

VEALKHTIKLLTVIKWHGPKCNKLNSKFWKK. 

LQYEMPFKRIEPITHEQALDVSEQGPFGELQT 

VSAISMAAATSTALATAHPDLRSTTHNTVHS 

QMRQKHYYRSYEYDVPPTGTLPLTSIGNQHT 

YCNIPMTLINGQRPQTKSSREQNPDEAHTNSA 

ILPLLPRETSISSVIW 


806 


2156 


A 


6614 


3 


1584 


NSARGGVGVRGARA^4A'lYQEKAAALNL^SAL 

HSPAHRPPGFSVAQKPFGATYVWSSIINTLQrr 

QVEVKKRRHRUOOiNDCFVGSEAVDVlFSHL 

IQNKYFGDVDIPRAKWRVOQALMDYKVFE 

AVPTKVFGKDKKITra>SSCSLYRnTIPNQD 

SQLGKENKLYSPARYADALr KJ»JUiKSAaLJiiJ 

LWENLSLKPANSPHVNISTTLSPQVINBVWQE 

ETIGRLLQLVDLPLLDSLLKQQEAVPKIPQPK 

RQSTMVNSSNYLDRGILKAYSDSQEDEWLSA 

AIDCLEYLPDQMWEISRSFPEQPDRTDLVKE 

LLFDAIGRYYSSREPLLNHLSDVHNGIAELLV 

NGKTEIALEATQLLLKLLDFQNREEFIOILLYF 

MAVAANPSEFIOXJKBSDNRMVVKRIFSKAIV 

DNKNLSKGKTDLLVLFL\MDHQKDVFKIPGT 

LVHKIVSWKILMAIQNGRDPNRDAGYIYCQRI 

DQRDYSNrrEKTnDELLYLLKTLDEDSKLSA 

KEKKKU-LGOFYKCHPDIFIEHFGD 


807 


2137 


A 


6615 


4198 


■"2094 ■ 


' 'FGIVGTFALETDELUSDRDPAIFSLUUl'UAMK 
PQILLXLALLTLGIJVAQHQDKVPCKMA^KML 
CPDRVDKKVSCQVLGLLQVPSVLPPDIETLD 
LSGNQLRSEASPLGFYTALRHLDLSTNEISFL 
QPGAFQALTHLEHLSLAHNRLAMATALSAG 
GLGPLPRVTSLDLSGNSLYSOLLERLLOEAPS 
LmXSLAENSLTRLTRHTFRDMPAIJEQLDLHS 
NVLMDIEDGAFEGLPRLTHLNLSRNSLTaSD 
FSLQQLRVLDLSCNSIEAFQTAS\QPQAEFQLT 
WLDLRENKUJffPDlAAIi»RLIYLNLSNNLIR 
LPTGPPQDSKGIHAFSEGWSALPLSVVPSGNAS 
GRPLSQLLNLDLSYNEEELIPDSFLEHLTSLCFL 
NLSRNCLRTFEARRLGSLPaMLLDLSHNALE 
TLELGARALGNSLRTUXQGNALRDLPPYTFA 
NLASLQRLNLQGNRVSPCGGPDEPGPVSGCVX 
AFSGITSLRSLSLVDNEIELLRAGAFLHTPLTE 
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nucleotide insertion 














"LDLSSNPGLEVATGALGOLEASLEVLALgow 

GLMVLQ\aDLPCnCLKRLNLAENRLSHLPAW 

TQAVSLEVLDLRNNSFSLLPGSAMGGLETSLR 

RLYLQGNPLSCCGNGWLAAQLHQGRVDVDA 

TQDLICRFSSQEEVSLSHVRPEDCEKGOLKNI 

l^niLTFILVSAlLLTTLAACCCVRRQKFNQQ 

YKA 


SOS 


21S8 


A 


6619 


153 


1852 


FKALSQYIYTNTHLEREAAFfiVAILLRRMEEO 

AimRNNTEKKHPGGGESDASPEAGSGGGGV 

ALKKEIGLVSACGIIVGNnGSOIFVSPKGVLEN 

AGSVGLALIVWIVTGFlTWOALCyAELGVNI 

PKSGGDYFYVKDIFGGLAGFLRLWIAVLVrYP 

TNQAVIALTFSNYVLQPLFPTCFPPESGLRLLA 

AT/-i» Til •nin/XTr'OCVTJWATDVOriTFTAGKL.L 
AICLLLLl W VNCoo V K W A 1 1\ V V^L^JT 1 /\viJ^xr*rf 

ALALniMGIVQICKGEYFWLEPKNAITOJFQEP 

DIGLVALAFLQGSFAYGGWNFLNVNVTEELV 

DP\YKNL\PRAMSIP\LVTFVYVFANV/ALYVT 

AMSPQEL\LASVNAVAVTFGEKLLGVMAWIM 

PISVALSTFGGVNGSLFTSSRLFFAGAREGHLP 

SVLAKflHVKRCTPIPALLFTCISTLLMLVTSD 

MYTLIhm^GFINYLFYGVTVAOQIVLRWKKP 

DIPRPIKINLLFPIIYLLFWAFLLVFSLWSEPW 

ELLTLVSQKMCVWYPEVERGSGIEEANED 
MEEOOOPMYQPTPTKDKDVAGQPQP 


809 


2159 


A 


6621 


1041 


223 


QDSRKMLPSTSVNSLVQ^MUVLNSRDAAKH 

TAGAKRYKYLRRLFRFRQMDFEFAAWQMLY 

LFTSPQRVYRNFHYRKQTODQWARDDPAFL 

VLLSIWLCVSTIOTGFVLDMGFFETIKLLLWV 

VLmCVGVGLLlATLMWFISNKYLVKRQSRD 

vrkVuu/nvAPnvHl NAFYPLLVILHFIOLFETN 

HVE-TDIHGYLVGNTLWLVAVGYYTYVTFL 

GYSVGU.FFS\ALPFLKNTVILLYPFAPLILLYG 

LSLALGWNFTHTLCSFYKYRVK 


810 


2160 


A 


6623 


160 


822 


"SPASGHCRLNGAAVAMFGCLVAGRLVQrrAA 
QQVAEDKFVFDLPDYESINHVVVFMLGTIPFP 
EGMGGSVYFSYPDSNGMPVWQLLGFVTNGK 
PSAIFKISGLKSGEGSQHPFGAMNIVRTPSVAQ 
IGISVELLDSMAQQTPVGNAAVSSVDSFTQFT 
QKMLDNFYNFASSFAVSQATPDDTQ/RPSKvff 
IPANVVLKWYENFQRRTSTEPSLLENIIWIKIN 

F 


sn 


2161 


A 


6627 


18 


3367 


UBGSLm'ERAKYYLTrrMPHl'-rVrKVEDPEEG 
AAASISQEPSLADIKARIQDSDEPDLSQNSrrG 
EHSQLLDDGHKKARNAYLNNSNYEEGDEYF 
' DKNIJVLFEEEMDTRPKVSSLLKRMAbrmiLT 
QGAKEHEEAENITEGKXKPTKTPQMGTFMG 
VYLPCLQNIFGVILFLRLTWWGTAGVLQAF 
AIVLICCCCmLTAISMSAIATNGWPAGGSY 
FMISRALGPEFGG AVGLCFYLGTTFAAAMy IL 
OAIEIFLVYIVPRAAIFHSDDALKESAAMLNN 
MRVYGTAFLVLMVLVVFIGVRYVNKFASLFL 
ACnOVSILAIYAGAnCSSFAPPHFPVCMLGNRT 
LSSRHIDVCSKTKEINNMTVPSKLWGFFCNSS 
QFFNATCDEYFVHNNVTSIQGffGLASGIITEN 
LWSNYLPKGEIIEFCPSAKSSDVLGSLNHEYVL 
VDnTSFOLVGIFFPSVTOIMAGSNRSGDLKD 
AQKSmGTIIj\ILTrSFVYLSNVVLFGAClEGV 
VLRDKFGDAVKGNLWGTLSWPSPWVrVlGS 
FFSTCGAGLQSLTGAPRLLOAIAKDNIIPFLRV 
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F=Phenylalanine, CHjIycine, H=Histidine, 
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Y=Tyrosine, X==Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














FGHSKANGEPTWALLLTAAIAELGILIASLDL 

VAPn^MmJ4CYLF\rNUCAU5TlXRTFNW 

RPRFRYYHWALSFMGMSICLALMFISSWYyA 

IVAMVIAGMIYKYIEYQGAEiaBWGDGIRGLS 

LSAARFAIXM.EEGPPHTIQWRPQLLVLLKL 

DEDUIVKHPRLLTFASQLKAOKOLTIVGSVIV 

r;MPr FNTYfiRALAAEOTIKHLMEAEKVKGFCQ 

LWAAKU^EGISHLIQSCGLGGMKHNTVVM 

GWPNGWRQSEDARAWKTFIGTVRVTTAAHL 

ALLVAKMSFFPSNVEQFSEGNIDVWWIVHDG 

GMLMaPFIXKVQHKVWRKCSIRFnTVAQLE 

DNSIQMKKDLATFLYHLRIEAEVBVVBMHDS 

DISAYTYERTLMMEQRSQMLRHMRLSKTER 

r\TiTj Am vifnPNSMLRLTSIGSDEDEETETY'O 

EKVHlvnWrKDKYMASRGQKAKSMEGFQDL 

LNMRPDQSNVRRMHTAVKLNEVIVNKSHEA 

KLmNMPGPPRNPEGDENYMEFLEVLTEGL 

RRVLLVRGOGSEVmYS 


812 


2162 


A 


6628 


66 


640 


AVCTMSEMAELSELYEESSDLQMDVMFUJEU 

DLPQN4EVGSGSRELSLRPSRSGAQQLEBEGP 

MEEEEAQPMAAPEGKRSLANGPNAGEQPGQ 

\/ A r» A TMTCCSPnPfJFFPDDWEDDYDYPEEEOLS 

GAGYRVSAALEEADKMFLRTREPALDGQFQ 

MHYEKTPFDQLAFIBELFNSLMVVNRLTEELG 

CDEEDDRE 


813 


2163 


A 


6630 


708 


1355 


akj^gaVkyiqIbLwrkkqsdvmrfllrvrcj 

wqyrqlsalhraprptrpdkarrlgykakq 

gy/vyiyigfvfaviyrirvrrggrkrpvpko 

atygkpvhhgvnqlkfarslqsvaeeragr 

hcgalrvlnsywvgedstykffevilidpfhk 

airrnpdtqwmcpvhkhremrgltsagrks 

rglgkghkfhhtiggsrraawrrrntlqlh 

RYR 


814 


2164 


A 


6635 


201 


1705 


'kgtemnksrwqsrrrhgrrshqqnpwfrlr 
dsedrsdsraaqpahdsghgddespstssgt 
agtssvpelpgfyfdpekkryfrllpghnncn 
pltkesirqkemeskrlrllqeedrrkkiarm 
gfnassmlrksqlgflnvtnychlahelrls 

CMERKKVQIRSMDPSALASDRFNLILADTNS 

drlfivndvtvgoskygiinlqslktptlkvf 

mhenlyfinrkvm^svcwaslnhldshillc 

lmglaetpgcatllpaslfvnshpagidrpg\ 

nolcsfripgawscawslniqanncfstglsr 

rvlltnwtghrqsfginsdvlaqqfalma 

pllfngcrsgeefaidlrcgnqgkgwkatrlf 

hdsavtsvrilqdeqylmasdmagkiklwd 

lrttkcvrqyeghvneyaylplhvheeegil 

vavgqdcytriwslhdarllrtipspypaska 

dipsvafssrlggsrgapgllmavgqdlycy 

SYS 


815 


2165 


A 


6643 


659 


3282 


■ NKNDLEVPSARTTKIMUDHLDLLLGVVLMAU 

pvfgipscsfdgriafyrfcnltqvpqvlntte 

rlllsfnyirtvtassfpfieqlqllelgsqyt 

pltidkeafrnlpnlrildlgsskiyflhpdaf 

qglfhlfelrlyfcglsdavlkdgyfrnlka 

ltrldlsknqirslylhpsfgklnslksidfss 

nqiflvceheleplqgktlsffslaanslysr 

vs\rdwgkcmnpfrnmvleildvsgngwtv 

ditgnfsnaisksqafslilahhimgagfgfhn 

ikdpdqnttaglarssvrhldlshgfvfslns 
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D=«Aspartic Acid, EKJlutamic Acid, 



F^Phenylalaninc, Q=Glycinc, H=Histidme, 
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M=Mettiiomne, N=Asparagine, P=Proline, 
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correspondi 
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/^possible nucleotide deletion, \Fpossible 
nucleotide insertion 



residue of 

peptide 

sequence 



' iVFETLKDLKVLNLAYNKJNKIADEAFYQjLU 
NLQVLNLSYNLLGELYSSNFYGLPKVAYIDL 
QKNmAIIQDQTFKFLEKLQTLDLRDNALrnH 
FIPSIPDIFLSGNKLVTLPKINLTANLIHLSENR 
LENLDILYFLLRVPHLQTLILNQNRFSSCSGDQ 
TPSENPSLEQLFLGENMLQLAWETELCWDVF 
EGLSHLQVLYLNHNYLNSLPPGVFSHLTALR 
GLSLNSNRLTVLSHNDLPANLEILDISRNQLL 
APNPDNO^SLSVLDrraNKFlCECELSTFINWL 
NHTNVmGPPADIYCVYPDSLSGVSLFSLSTE 
GCDEEEVlXSLKFSLFIVCTmTLFLMTILTV 
TKFRGFCnCYKTAQRLVFKDHPQGTEPDMY 
KYDAYLCFSSKDFTWVQNALLKHLDTQYSD 
ONRFNLCFEERDFVPGENIOWnQDAIWNSR 
KIVCLVSRHFLRDGWCLEAFSYAQGRCLSDL 
NSALIMWVGSLSQYQLMKHQSIRGFVQKQQ 
YLRWPEDLQDVGWFLHKLSQQILKKEKEKK 

KDNNIPL OTVATIS „ 

"RDRAGVRPAGKQHAAAAFYDVGGDRPWUS 
GNTQLPPRNPVKANAMFOAGDEDDTDFLSPS 
GGARLASLFGLDQAAAGHGNEFFQYTAPKQP 
KKGQGTAATGNQATPKTAPATMSTPTILVAT 
AVHAYRYTOGQYVKQGKFGAAVLGNHTTR 
EYRILLYISQQQPVTVARIHVNFELMVRPNNY 
STFYDDQRQNWSIMFESEKAAVEFNKQVCIA 
KCNSTSSLDAVLSQDLIVADGPAVEYGDSLE 
VAYTGWLFQNHVLGQVFDSTANKDKLLRLK 
LGSGKVIKGWEDGMLGMKKGGKRLLIVPPA 
CAVGSEGVIGWTQATDSILVFEVEVRRVKIA 
KDSGSDGHSVSSRDSAAPSPIPGADNLSADPV 
VSPPTSIPFKSGEPALRTKSNSLSEQLAINTSPD 
AVKAKLISRMAKMGQPMLPILFPQLDSNDSEI 
EDVNTLQGGGQPWTPSVQPSLQPAHPALPQ 
MTSQAPQPSVTOLQAPSAALMQVSSLDSHSA 
VSGNAQSFQPYAGMQAYAYPQASAVTSQLQ 
PVRPLYPAPLSQPPHFQGSGDMASFLMTEAR 
QHNTEIRMAVSKVADKMDHLMTKVEELQKH 
SAGNS\lLn>SMSV™ETSMIMSNIQRnQENER 
LKQEILEKSNRIEEQNDKISELIERNQRYVEQS 
NLMMEKRNNSLQTATENTQARVLHAEQEKA 
KVTEELAAATAQVSHLQLKNfTAHQKKETEL 
QMQLTESLKETDLLRGQLTKVQAKLSELQET 
SEQAQSKFKSEKQNRKQLELKVTSLEEELTDL 
RVEKESLEKNLSERKKKSAQERSQAEEEIDEI 
RKSYQEELDKLRQLLKKTRVSTDQAAAEQLS 
LVQAELQTQWEAKCEHLLASAKDEHLQQYQ 
EVCAQRDAYQQKLVQLQEKSVCFA\CLALQA 
QITALTKQNEQHIKELEKNKSQMSGVEAAAS 
DPSEKVKKIMNQVFQSLRRBFELEESYNGRTI 
LGTIMNTIKMVTLQLLNQQEQEKEESSSEEEE 
EKAEERPRRPSQEQSASASSOQPQAPLNRERP 
ESPMVPSEQWEEAVPLPPQALTTSQDGHRR 
KGDSEAEALSEDCDGSLPPELSCIPSHRVLGPP 
TSIPPEFLGPVSNDSECEESLAASPMAAKVPDN 
PSGKWCVRBVAPDGPUJESSTRLSLTSVDPEE 
GDPLALGPESPGEPQPPQLKKDDVTSSTGPHK 
ELSSTEAGSTVAGAALRPSHHSQRSSLSGDEE 
DELFKGATLKALRPKAQPEEBDEDEVSMKGR 

PPPTPL FGPDDDDDDIDWLG . . 

' TPRSSSDNGSPIRQYEmSTPAHQGFVMCiLbG 




251 



wo 01/57188 



PCTAJSOl/03800 



SEQ ID 
NO: of 
nucl- 
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peptide 
seq- 
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in 
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nucleotide 

location 
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ng to first 

amino acid 

residue of 

peptide 
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Predicted end 
nucleotide 
location 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIamne OQrsteine, 
I>=Aspartic Acid, EH31utBmic Acid, 
F=PhcnylaIanine, GKjlycine, H«=Histid5ne, 
I=Isoleucinc, K=Lysme, L^Leucine, 
M=Metluomne, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrconine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, •=Stop codon, 
/=possiDlc nucieouQe ceieuoni v*pos5ioic 
nucleotide insertion 














KS/ARNSQLRIVLVGKTGAGBLSATGNSILGRK 

VFHSGTAAKSnXKCEKRSSSWKETELVWD 

TPGIFDTEVPNAETSKEIIRCILLTSPGPHALLL 

VWLGRYTEEEHKATEKILKMFGERAIISFMIL 

IFTRKDDLGDTNLHDYLREAPEDIQDLMDIFG 

DRYCALNNKATGAEQEAQRAQLLGLIQRW 

RENKEGCYTNRMYQRAEEEIQKQTQAMQEL 

HRVELEREKARIREEYEEKIRKLEDKVEQEKR 

KKQMEKKLAEQEAHYAVRQQRARTEVESKD 

GILEUMTALOIASFILLRLFAED 


81S 


2168 


A 


6660 


357 


1890 


APSGSWTRVVLTLDPCSLRSRSPRSLLDPGMP 

GISARGLSHEGRKQLAVNLTRVLALYRSDLDA 

YIIEFRTDNLWDTLPCSWQEALDGLKPPQLA 

TMLLGMPGEGEWRYRSVWPLTLLALKSTA 

CALAFTRMPGFQTPSEFLENPSQSSRLTAPFR 

KHVRPKKQHEIRRLGELVKKLSDFT/GLHPGC 

RRGLRPGNHLSRFMALGLGLMVKSIEGDQRL 

VERAQRLDQELLQALEKEEKRNPQWQTSPR 

HSPHHWRWVDPTALCEELLLPLENPCQGRA 

RLLLTGLHACGNDLSVALLRHFSCCPEWALA 

SVGCCYMKLSDPGGYPLSQWVAGLPGYELP 

YRLREGACHALEEYAERLQKAOPGLRTHCY 

RAALETVIRRARPELRRPGVQGIPRVHELKIBE 

YVQRGLQRVGLDPQLPLNLAALQAHLAQEN 

RWAFFSLALLLAPLVETLILLDRLLYLQEQA 

LSPVGFHAELLPIFSmSPKNLVLVATKMPLG 

OALSVLETEDS 


819 


2169 


A 


6661 


65 


2686 


SGSGHCLAEAASMGPWGWKLRWTVALLLA 

AAGTAVGDRCERNEFQCQDGKCISYKWVCD 

GSAECQDGSDESQErCLSVTCKSGDFSCX>OR 

VNRCIPQFWRCDGQVDCDNGSDEQGCPPKTC 

SQDEFRCHDGKCISRQFVCDSDRDCLDGSDE 

ASCPVLTCGPASFQCNSSTCIPQLWACDNDPD 

CEDGSDEWPQRCaiQLYVFQGDSSPCSAFEFH 

CLSGECIHSSWRCDGGPDCXDKSDEENCAVA 

TCRPDEFQCSDGNOHGSRQCDREYDCKDMS 

DEVGCVNVTLCEGFNKFKCHSGECITLDKVC 

NMARDCRDWSDEPIKBCGTNECLDNNGGCS 

HVCNDLKIGYECLCPDGFQLVAQRRCEDIDE 

CQDPDTCSQLCVNLEGOYKCQCEEGFQLDPH 

TKACKAVGSIAYUTnrNRHEVRiavrrL^ 

TSLIPNUOJWAUJTEVASNRIYWSDLSQRMI 

CSTQLDRAHGVSSYDTVISRDIQAPDGLAVD 

WIHSNIYWTDSVlXnVSVADTKGVKRKTLFR 

ENGSKPRAIWDPVHOFM Y W 1 D w u i r ajsus. 

KGGLNOVDIYSLVTENIQWPNGrrLDLLSGRL 

YWVDSKLHSISSIDVNGGNRKTILEDEKRJLAH 

PFSLAVFEDKVFWTOIINEAIFSANRLTGSDV 

NLLAENLLSPEDMVLFHNLTQPRGVNWCERT 

TLSNGGCQYLCLPAPQI^a'HSPKFTCACPDGM 

LLARNDMRSCLTEGVEAAVATQETSTVRLKVS 

STAVRTQHTTmVPDTSRLPGATPGLTTVEI 

VTMSHQALGDVAG\RGN\EKKPSSVRALSIVL 

PIVU.LVFLCLGVFLLWKNWRLKMNSINFDNP 

VYQKTTEDEVfflCHNQDGYSYPSRQMVSLED 

DVA 


820 


2170 


A 


6666 


17 


4146 


■ ERGISSQIKGMKSOSGGGSPTSLWGLLFLSAA 
LSLWPTSGEICGPGIDIRNDYQQLKRLENCTVI 
EGYUfflXISICAEDYRSYRFPKLTVITEYU.LF 
RVAGUBSLGDLFPNLTVIRGWKLFYNYALVIF 



252 



wo 01/57188 



PCTAJSOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

concspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A'=AlaniQe C=Cysteine, 
I>=Aspartic Acid, E==Glutamic Add, 
F=PhenyIaIanine, G^GJycine, H=Histidine, 
I«Isoleucine, K=Lysinc, LpLcucinc, 
M=Mcthionine. N=Asparagine, P^Prolme. 
Q=<j|utamine. R=Arginine, S=Scrine, 
T=Threonine, V=»Valine, W«Tiyptophan, 
y=Tyrosine, X^Unknown, ♦s^top codon, 
/=^ossible nucleotide deletion, V"possiblc 
nucleotide insertion 














BMmKDIGLYNLRNITRCMIRIEKNADLCYL 

STVDWSLILDAVSNNYIVGNKPPKECGDLCP 

GTMEEKPMCEKTTINNEYNYRCWTTNRCQK 

MCPSTCGKRACTENNECCHPECLGSCSAPDN 

lyrACVACRHYYYAOVCVPACPPNTYRFEGW 

RCVDRDFCANILSAESSDSEGFVIHDGECMQE 

CPSGFlRNGSQSMYCIPCEGPCPKVCEEEKKr 

KTIDSVTSAQMLQGCTIFKGNLLINIRRGNNIA 

SELENFMGLIEWTGYVKIRHSHALVSLSFLK 

NLRLILGEEQLEGNYSFYVLDNQNLQQLWD 

WDHimLTIKAGKMYFAFNPKLCVSEIYRMEE 

VTGTKGRQSKGDINTRNNGERASCESDVLHF 

TSTTTSKNRlirrWHRYRPPDYRDLISFTVYYK 

EAPFKNATTEYDGQDACGSNSWNMVDVDLPP 

NKDVEPGILLHGLKPWTQYAVYVKAVTLTM 

VENDHIRGAKSEILYIRTNASVPSIPU)VLSAS 

NSSSQUVKWNPPSLPNONLSYYIVRWQRQP 

QDGYLYRHNYCSKDKIPIRKYADGTrDIEEVT 

ENPKTEVCGGEKGPCCACPKTEAEKQAEKEE 

AEYRKVFENFLHNSIFVPRPERKRRDVMQVA 

NTTMSSRSRNTTAADTYNrrDPEELETCYPFF 

ESRVDNKERTVISNLRPFTLYRIDIHSCNHEAE 

KLGCSASNFVFARTMPAEGADDIPGPVTWEP 

RPENSIFLKWPEPENPNGLILMYEIKYGSQVE 

DQRECVSRQEYRKYGGAKLNRLNPGNYTARI 

QATSLSGNGSWTDPVFFYVQAKRYENFIHUI 

ALPVAVLLIVGGLVIMLYVFHRKRNNSRLGN 

OVLYASVNPEYFSAADVYVPDEWBVAREKIT 

MSRELGQGSFGMVYBGVAKGWKDEPETRV 

AIKTN^AASMRERIEFLNEASVMKEFNCHH 

VVRLLGVVSQGQPTLVIMELMTRGDLKSYLR 

SLRPEMENNPVLAPPSLSKMIQMAGEIADGM 

AYLNANKFVHRDIj^ARNCMVAEDFrVKIGD 

FGMTRDrYETDYYRKGGKGLLPVRWMSPESL 

KDGVFTTYSDVWSFGWLWEIATLAEQPYQ 

GLSNEQVLRFWMEGOLLDKPDNCPDMLFEL 

MRMCWQYNPKMRPSFLEIISSlKEEMEPaFRii 

VSFYYSEENKLPEPEELDLEPENMESVPLDPS 

ASSSSLPLPDRHSGHKAENGPGPGVLVLRASF 

DERQPYAHMNGGRKNERALPLPQSSTC 


821 


2171 V 


A 


6691 


106 


825 


GRVLFRGCGVGHKGQVLMGTFILAQDWLSE 

SNHVFCVSSMLRLQKRLASSVLRCGKKKVW 

LDPNETNEIANANSRQQIRKLIKDGLIIRKPVT 

VHSRARCRiam-ARRKGRHMGIGKRKGTAN 

ARMPEKVTAVMKRMlsJULKKJJJv^ i Kca/Jviv 1 1\ 

ESKKIDRHMYHSLYLKVKGNVFKNKRILMEH 

IHKLKADKARKKLLADQAEARRSKTKEARK 

RREERLQAKKEEUKTLSKEEETKK 


822 


2172 


A 


6713 


/ /A 


21 


DFRPGLLLPRKKKMFGFHKPKMYRSIEGCXCl 

SGAKSSSS\RFn)SKRYEK\DFQ\SCFGLHETR\ 

SGDI\CNA\CVLL\U»WKKLPAGSKKVNWNH 

VVDARAOPSUJCTTLKPKKVKTL\SGNRIK\ST 

QISICUJKEFKRVHNSDAHSTTSVSASPVAQSPLF 

TVNQFRWTGSDTGVGFPGSNRNHPVFSFLDLX 

TYWKRQKICCGIMYKGRFGEVLIDTHLFKPCC 

SNKKA\AAEKPEEQGPEPLPISTQEWVTEVFM 


823 


2173 


A 


6727 


3 


4063 


PYLATLQLDSSLLEPPKYQTPPAAAQGQATPO 
NAGPLAPNOSAAPPAOSAFNPTSNSSSINFAA 
SSSASGSSVPPVSSSASAPGISQISTTSSSGFSGS 
VGGQNPSTGGISADRTQGNIGCGGDTDPGQS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 
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beginning 

nucleotide 
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correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue • 
of peptide 
sequence 


Amino acid sequence (A=Aianine 0=Cysteine» 
]>Aspartic Acid, &=Glutamic Acid, 
F=PhenylaIanine, G=Glycine, H=Histidine. 
Msoleucine, K=Lysine, l>=Leucine, 
M==Me&ionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=^eonine, V=Valine, W=Tiyptophan, 
Y»Tyrosine, X=Unknown, *«Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














"SSQPSQDGQESNVPSVGSLADPDYLNTPi^MfS 

TPVTLNSAAPASNSGAGVLPSPATPRFSVPTP 

RTPRTPRTPRGGGTASGQGSVKYDSTDQOSP 

ASTPSTTRPLNSVEPATMQPIPEAHSLYVTLE, 

SDSVMNlFKDRNFDSCaCA(>(MNIKGADVG 

LYIPDSSNEDQYRCTCGFSAIMNRKLGYNSGL 

FLEDELDIFGKNSDIGQAAERRLMWVICQSTFL 

PQVEGTKKPQEPPISLLLLljQNQRrQPFASLN 

FLDYISSNNRQTLPCVSWSYDRVQADNNDY 

WTECFNALEQGRQYVDNPTGGKVDEALVRS 

ATVHSWPHSNVLDISMLSSQDVVRMIXSLQP 

FLQDAIQKXRTGRTWENIQHVQGPLTWQQFH 

KMAGRGTYGSEESPEPLPIPTLLVGYDKDFLT 

ISPFSLPFWERLLLDPYGGHRDVAYIWCPEN 

EALLEGAKTFFRDLSAVYEMCRLGQHKPICK 

VLRDGIMRVGKTVAQKLTDELVSBWFNQPW 

SGEENDNHSRLKLYAQVCRHHLAPYLATLQL 

DSSLLIPPKYQTPPAAAQGQATPGNAGPLAPN 

GSAAPPAGSAFNPTSNSSSTNPAASSSASGSSV 

PPVSSSASAPGISQISTTSSSGFSOSVGGQNPST 

GGISADRTQGNIGCGQDTOPOQSSSQPSQDO 

QESVTERERIGIPTEPDSADSHAHPPAWIYM 

VDPFTYAAEEDSTSGNFWLLSLMRCYTEMLD 

NLPEHMRNSFILQIWCQYMLQTmDEQVFY 

IQYLKSMAfSVYCQCRRPLPTQIHIKSLTGFGP 

AASIEMTLKNPERPSPIQLYSPPFILAPIKDKQT 

ELGETFGEASQKYNVLFVGYCLSHDQRWLL 

ASCTDLHGELLETCWNIALPNRSRRSKVSAK 

KIGLQKLWEWaGIVQ^rrSLPWRWIGRLGR 

LGHGELKDWSILLGECSLQnSKKLKDVCRM 

CGISAADSPSILSACLVAMEPQGSFWMPDAV 

TMGSVFGRSTALNMQSSQLNT?QDASCTHIL 

VFPTSSTIQVAPANYPNEDGFSPNNDDMFVDL 

PFPDDMDNDIGILMTONLHSSPNSSPVPSPGSP 

SGIGVGSHFQHSRSQGERLLSREAPEELKQQP 

LALGYFVSTAKAENLPQWFWSSCPQAQN\QC 

PLFLKASLHHfflSVAQTDELLPARNSQRVPHP 

LDSKTTSDVLRFVLEQYNALSWLTCNPATQD 

RTSCLPVHFWLTQLYNAIMNIL 


S24 


2174 


A 


6732 


2440 


365 


■VEBGLGRRRTPPGORRGPVTPAmjPDSVRR 
RLLPPSSAAAFSSHRHNLLCSRRRGGGGGGG 
GGGGGTIBCRPGITGPTAATSPSGEPGNAASAP 
LSLLSPFPGQITYQHPGVAEPSAYGGRDVAC 
ASLVFGRLQHRGGDRKRGLLGRSSGDAASD 
QPFRCRSQSTAQRLVKQMDFTEAYADTCSTV 
GLAAREGNVKVLRKLLKKGRSVDVADNRG 
WMPIHEAAYHNSVECLQMLINADSSENYIKM 
KTFEGFCALHLAASQGHWKTVQILLEAGADP 
NATTLEETTPLFLAVENGQIDVLRLLLQHGAN 
VNGSHSMCGWNSLHQASFQENAEIIKLLLRK 
GANKECQDDFGrrPLFVAAQYG\KLESL\SILIS 
SGXANVNCQALDKATPLFIAAQEGHTKCVELL 
LSSGADPDLYCNEDSWQLPIHAAAQMGKTKl 
LDLLIPLTNRACDTGLNKVSPVYSAVFGGHE 
DCLEILLRNGYSPDAQACLVFOFSSPVCMAFQ 
KDCEFFGIVNILLKYGAQINELHLAYCLKYEK 
FSIFRYFLRKGCSLGPWNHIYEFVNHArKAQA 
KYKEWLPHLLVAGFDPLDXCNSWIDSVSIDT 
LIFTLEFTNWKTLAPAVERMLSARASNAWIL 
OQHUTVPSLTHLCRLEIRSSLKSERLRSDSYIS 
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seq- 
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SEQID 
NO: of 
peptide 
seq- 
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rrcuicicu 

beginning 
nucleotide 
location 
coirespondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


iPredicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence CA=Alanine OCysteine. 
I>Aspartic Acid, E^lutamic Acid, 
F=Phenylalanine> G=Glycine, H=Histidine, 
I«Isoleucinc, K^Lysine, L^Leucine, 
M-Mcthionine, N-Asparaginc, P"Prolinc, 
Q=01utamin^ R=Arginine. Sparine, 
T=Threonine, V^Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon, 
hpossMe nucleotide deletion, \=possible 
nucleotide insertion 














QLPLPRSLHNYLLYEDVLRMYEVPELAAIQD 
G 


823 


2175 


A 


6735 


277 


1252 


WMGLFDRGVQMIXTTVOAFAAFSLMTIAVG 

TDYWLYSRGVCKTKSVSENETSKKNEEVMT 

HSGLWRTCCLEGNFKGLCKQIDHFPEDADYE 

AnrAEYFLRAVRASSIFPILSViaFMGGLCIA 

ASEFYKTRHNIILSAGIFFVSAGLSNnGnVYIS 

ANAGDPSKSDSKKNSYSYGWSFYFGALSFllA 

EMVGVLAVHMFIDRHKQLRATARA\TDYLQ 

ASAITRIPSYRYRYQRRSRSSSRSTEPSHSRDA 

SPVGDCGFNTLPSTEISMrrLSRDPLKAATTPT 

ATYNSDRDNSFLQVHNCIQKENKDSLHSNTA 


826 


2176 


A 


6744 


3 


5177 


" SDDLRTGLFQDVQDAESliU-PGVYEVLFYNfi 
TEDCPGMMLWRYPEPRGLTLVRITPVPFNTT 
EDPDISTADLGDVLQDPCSLEYWDELQKVFV 
AFREFNLSESKVCELQLPDINLVNDQKKLVSS 
DLWRIVLNSSQNGADDQSSASESGSQSTCDPL 
VTPTALAACIKVDSCFTPWFVPSLCVSFQFAH 
LEFHLCHHLDQLGTAAPQYLQPFVSDRNMPS 
ELEYMIVSFREPHMYLRQWNNGSVCQEIQFL 
AQAIXIKIXECRNVTMQSVVKPFSIFGQMAVS 
SDVVEKIXDCTVrVOJSVFVNLOQHVVHSLNT 
AIQAWQQNKCPEVEELVFSHFVICNDTQETL 
RFGQVDTDENDLLASLHSHQYSWRSHKSPQL 
LHICIEGWGNWRWSEPFSVDHAGTFIRTIQYR 
GRTASLmCVQQLNGVQKQniCGRQIICSYLSQ 
SDELKWQHYIGQDGQAWREHFDCLTAKQK 
LPSYILENNELTELCVKAKGDEDWSRDVCLE 
SKAPEYSIVIQVPSSNSSIIYVWCTVLTLEFNS 
QVQQRMIVFSPLFIMRSHLPDPinHLEKRSLGL 
SETQnPGKGQEKPLQNlEPDLVHHLTFQAREE 
YDPSDCAVPISTSLIKQIATKVHPGGTVNQILD 
EFYGPEKSLQPIWPYNKKDSDRNEQLSQWDS 
PMRVKLSIWKPYVRTLLIELLPWALLINESKW 
DLWLFEGEKIVLQVPAGKHIPPNFQEAFQIGIY 
WANTNTVHKSVAIKLVHNLTSPKWKDGGNG 
EWTLDEEAFVDTEIRLGAFPGHQKLCQFCIS 
SMVQQGIQIIQIEDKrniNNTPYQIFYKPQLSV 
CNPHSGKEYFRVPDSATFSICPGGEQPAMKSS 
SLPCWDLMPDISQSVLDASLLQKQIMLGFSPA 
PGADSSQCWSLPAIVRPEFPRQSVAVPLGNFR 
ENGFCTRAJVLTYQEHLGVTYLTLSEDPSPRV 
IIHNRCPVKMLIKENlia^ffKFEVYCKKIPSECS 
IHHELYHQISSYPDCKTKDLLPSLLLRVEPLDE 
VTTEWSDAIDINSQGTQWFLTGFGYVYVDV 
VHQCGTVFITVAPEGKAGPILTNTORAPEKIV 
TF/KMFITQLSLAVFDDLTHHKASAELLRLTL 
DNIFLCVAPGAGPLPGEEPVAALFELYCVEIC 
CGDLQLDNQL YNKSNr Hf A V L V L.i^JiivACr i 
QCSKMQSLUSNKELEEYKEKCFKLCraNEG 
KSILCDINEFSFELKPARLYVEDTFVYYIKTLF 
DTYLPNSRLAGHSTHLSGGKQVLPMQVTQH 
ARALVNPVKLRKLVIQPVNLLVSIHASLKLYI 
ASDHTPLSFSVFERGPIFTTARQLVHALAMHY 
AAGALFRAGWWGSLDILGSPASLVRSIGNG 
VADFFRLPYEGLTRGPGAFVSGVSRGTTSFVK 
fflSKGTLTSITNIj^TSIj^RNMDRI^OEEHYN 
RQEEWRRQLPESLGEGLRQGLSRLGISLLGAI 
AGIVDQPMONFQKTSEAQASAGHKAKGVISG 
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nucl- 
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seq- 
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SEQ ID 
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seq- 
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ng to first 
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peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D^Aspartic Acid, B=Glutamic Acid, 
F«Pheiiylalanine, G=Glycine, H==Histidine, 
I«IsolBucine, K^Lysine, I>Ceucine, 
M-Methionine, N^Aspaiagine, P^Proliae, 
Q=Glutaniine. R^Aii^ine, S=Serine, 
T=^rhreonine. V^Valinc, W==Tryptophan, 
Y=»Tyrosinc, X«4Jnknown, *=Stop codon, 
possible nucleotide deletion, V^ssible 
nucleotide insertion 














VGKGIMGVFTKPIGGAAELVSQTGYGILHGA 

GLSQLPKQRHQPSDWHADQAPNSHVKYVW 

KMLQSLGRPEVHMALDWLVRGSGQEHEGC 

LLLmVUWSVSEDTXJQQAFPVTEIDCAQD 

SKQNNLLTVQLKQPRVACDVEVDGVRERLSE • 

QQYNW-VDYTrKTSCHLAPSCSSMQIPCPVVA 

AEPPPSTVKTYHYLVDPHFAQVFLSKFnvfVK 

NKAUOCGFP 


827 


2177 


A 


6748 


2 




FVGAPlRR6NPyGSP6NPGRtlQGPCHRPRGTK 

ASGVSPTLWRPQAAATGLEMPSSGRALLDSP 

LDSGSLTSLDSSVFCSEOEGEPLALGDCFTVN 

VGGSRFVLSQQALSCFPHTRLGKLAWVASY 

RRPGALAAVPSPLELCaDDANPVDNEYFFDRS 

SQAFRYVLHYYRTGiO-HVMEQLCALSFLQEI 

nvwnmPT QTD^CCRDRYFRRKELSETLDFKK 

rnBDQESQHESEQDFSQGPCPTVRQKLWNIL 

EKPGSSTAARIFGVISnFVGVSIINMALMSAEL 

SWLDLQLLEDLEYVCISWFTGEFVLRFLCVRD 

RaaT.RKVPNIIDIJJUIJTYlTLLVESLSG\SQT 

TQEL\ENVGAHCPGCLRLLRAL\RMLKAWGR 

HSTGLRSLGMTITQCYEEVGLLLLFLSVGISIF 

STVEYFAEQSIPDTTFTSVPCAWWATTSMT 

TVGYODlRPDTrTGKIVAFMCILSGlLVLALPI 

AIINDRFSACYFTLKLKEAAVRQREALJaaTK 

NIATDSYISVNLRDVYARSIMEMLRLKGRER 

ASTRSSGGDDFWF 


828 


2178 


A 


6786 


5672 


1360 


"OTHPASSGPVPLPPAAVSAA'rKEELGEPVPhV 
TASSGFQSMHSSNPKVRSSPSGNTQSSPKSKQ 
EVMVRPPTVMSPSGNPQLDSKFSNQGKQQGS 
ASQSQPSPCDSKSGGHTPKALPGPGGSMGLK 
NGAGNGAKGKGKRERSrSADSFDQRDPGTPN 
DDSDDCECNSADHIKSQDSQHTPHSMTPSNAT 
APRSSTPPHGQTTATEPTPAQKTPAKWYVFS 
TEMANKAAEAVLKGQVETIVSFHIQNISNNK 
TERSTAPLNTQISALRNDPKPLPQQPPAPANQ 
DQNSSQNTRLQPTPPIPAPAPKPAAPPRPLDRE 
SPGVENKLIPSVGSPASSTPLPPDGTGPNSTPN 
NRAVTPVSQGSNSSSADPKAPPPPPVSSGEPPT 
LGENPDGLSQEQLEHRERSLQTLRDIQRMLFP 
DEKEFTGAQSGGPQQNPGVLDGPQKKPEGPI 
QAMMAQSQSLGKGPGPRTDVGAPFGPQGHR 
DVPFSPDEMVPPSMNSQSGTTGPDHLDHMTP 
EQIAWLKLQQEFYEEKRRKPEQVWQQCSLQ 
DMMVHQHGPRGWRGPPPPyQMTPSEGWAP 
GGTEPFSDGINMPHSLPPRGMAPHPNMPGSQ 
MRLPGFAGMINSEMEGPNVPNPASRPGLSGV 
SWPDDVPKIPDGRNFPPOQGIFSGPGRGERFP 
NPQGLSEEMFQQQLAEKQLGLPPGMAMEGIR 
PSMEMNRMIPGSQRHMEPGNNPIFPRIPVEGP 
LSPSRGDFPKOlPr C^MOr uKJCVLJir ljivi v r ovjm 
KGDVNLNVNMGSNSQMIPQKMREAGAGPEE 
MLKLRPGGSDMLPAQQKMVPLPFGEHPQQE 
YGMGPRPFLPMSQGPGSNSGLRNLREPIGPDQ 
RTNSRLSHMPPLPLNPSSNFTSLNTAPPVQRG 
LGRKPLDISVAGSQVHSPGINPLKSPrMHQVQ 
SPMLGSPSGNLKSPQTPSQLAGMLAGPAAAA 
SKSPPVLGSAAASPVHLKSPSLPAPSPGWTSS 
PEPPLQSPGIPPNHKAPLTMASPAMLGKVESG 
GPPPPTASQPASVNIPGNSLPSSTPYTMPPEPTL 
SQOTLSBVflMSR\MSKPAM\PSVSNPGYNHD^ 
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^<WASSDDDSPPARSP^OSM^INMPGMGINT 
QNPRISGPNPWPMPTLSPMGMTQPLSHSNQ 
MPSPNAVGPNIPPHGVPMGPC3LMSHNPIMGH 
GSQEPPMVPQGRMGFPQGFPPVQSPPQQVPFP 
HNGPSGGQGSFPGGMGFPGEOPLORPSNLPQ 
<;<;ADAALCKPGGPGGPDSFTVLGNSMPSVFr 
DPDLQEVIRPGATGJPEFDLSRHPSEKPSQfTLQ 
YFPRGEVPGRKQPQGPGPGFSHMQGMMGEQ 
APRMGLALPGMGGPGPVGTPDIPLGTAPSMP 
GHNPMRPPAFLQQGMMGPHHRMMSPAQST 
MPGQPTLMSNPAAAVGMIPGKDRGPAGLYT 
HPGPVGSPGMMMSMQGMMGPVNRTS 


829 


2179 


A 


6797 


433 


3 


"ASFFNFSICICKIILEVGPPVOHPAHDDVGURH 
GPGGR/GSRSPRSLQCAPGGGRRSGCPAGSSP 
ASTCPPSPGGSGADRFGPSPPPPSREAAPTAG 
AAASSTSSGASCPPVPASSRWGVRSRTRSGSG 
GEREPRDRPSERPRLV 


830 


2180 


A 


6800 


3 


1911 


LPERAFGPRTPRAPRJRRRRRLLLSPPPRPFFI'L 

DREPRAPGPWLCPSRAGTAQDPARIRERRGR 

VAGGAAGPAMELRARGWWLLCAAAALVAC 

ARGDPASKSRSCGEVRQIYGAKGFSSS\DVPQ 

AEISGEHLRJCPQGYTCXrrSEM^NLANRSHA 

ELETALRDSSRVLQAMLATQLRSFDDHFQHL 

LNDSERTLQATFPGAFGELYTQNARAFRDLY 

SELRLYYRGANLHLEETLAEFWARLLERLFK 

QLHPQLLLPDDYLDCLGKQAEALRPF\GEAP\ 

oT?T OT D A TAP AVWA AP\SFVOGLGVAS\DVVR 

KVAQVPLGVPEOSRAVIEAGSYC/ALHCVGVP 

GARPCPDYCRNVLKGCLANQADLDAEWRNL 

LDSMVLITDKFWGTSGVESVIGSVHTWLAEA 

INALQDNRDTLTAKVIQGCGNPKVNPQGPGP 

EEKRRRGKLAPRERPPSGILEKLVSEAKAQL 

RDVQDFWISLPGTLCSEKMALSTASDDRCWN 

GMARGRYLPEVMGDGLANQINNPEVEVDIT 

KPDMmQQIMQLKIMTNRLRSAYNGNDVDF 

QDASDDGSGSGSGDGCLDDLCGRKVSRKSSS 

SRTPLTHALPGLSEQEGQKTSAASCPQPPTFL 

LPLLLFLALTVARPRWR 


831 


2181 


A 


6808 


2 


1522 


ASRHGMTPGALLMLLGALGPPLAPUVRGSEA 

EGRLRBKLFSGYDSSVRPAREVGDRVRVSVG 

ULAQLISLNEKDEEMSTKVYLDLEWTDYRLS 

WDPAEHDGIDSLRJTAESVWLPDWLLNNND 

GNFDVALDISVWSSDGSVRWQPPGIYRSSCS 

lONTITFPFDWQNCTMVFSSYSYDSSEVSLQT 

GLGPDGQGHQEIHIHEGTFIENGQWENIHKPS 

RLIQPPGDPRGGREGQRQEVIFYLIIRRKPLFY 

LVNVIAPCE-riLLAIFVFYLPPDAGEKMGLSIF 

ALLTLTVFLLLLADKVPETSLSVPinKYLMFT 

Nm.VTFSVILSVVVLNl.HHRSPHTHQMPLWV 

RQH'IHKLPLYLIUjKKrM'tUUJl-^Mriirrnv^Dor 

GSGWGRGTDEYFIRKPPSDFLFPKPNRFQPEL 

SAPDLRRFIDGPNRAVALLPELREWSSISYIA 

RQLQEQEDHDALKEDWQFVAMWDRLFLW 

TFTIFTSVGTLWIFLDATYHLPPPDPFP 


832 


2182 


A 


6824 


71 


1079 


' ETMAKNPPENCEDCHILNAEAFKSKKICKSLK 
ICGLWOnjU-TLIVLFWGSKHFWPEVPKKAY 
DMEHTFYSNGEKKKIYMEIDPVTRTEIFRSGN 
GTDETLEVHDFKNGYTGIYFVGLQKCFIKTQI 
KVIPEFSEPEEEIDEN^riTTFFEQSVIWVPAE 
KPIEhm)FLKNSKILEia)NVTMYWVINPTL\^ 
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GTFAKQLHHNFAFIILVSELQDFEEEGEDLHFP 

ANEKKGIEQNEQWWPQVKVEKTRHARQAS 

EEEUTNDrreNGIEFDPMLDERGYOCIYCRR 

GNRYCRRVCEPLLGYYFYPYCYQGGRVICRV 

IMPCNWWVARMLGRV 


833 


2183 


A 


6846 


116 


602 


EAEGEQVCOAKCCGDAPHVENREEETARIGP 

GVMESKEERAUWLIVENVNQENDEKDEKE 

QVANKGEPLALPLNVSEYCVPRGNRRRFRVR 

QPILQYRWDIMHRLGEPQARMREENMERIGE 

EVRQLMEKLREKQLSHSLRAVSTDPPHHDHH 

DEFQLMP 


834 


2184 


A 


6831 


3 


2024 


PNGVAlXHLPGAAVJPhTTNYMFQDALGGRSR 

GSREESPAPSRAPASASLWRRLVWEAKMAA 

HAAAAAQAAAAQAAHAEAADSWYLALLGF 

AEHFRTSSPPKIRLCVHCLQAVFPFKPPQRIEA 

R-raLQLGSVLYHHTKNSEQARSHLEKAWLIS 

QQIPQFEDVKFEAASLLSELYCQENSVDAAKP 

LLRKAIQISQQTPYWHCRIXFQLAQLHTLEKD 

LVSACDLLGVGAEYARWGSEYTOALFLLSK 

GMLLUVIERKUJEVHPIXTIXXJQIVENWQGN 

PIQKESLRVFFLVLQVTHYLDAGQVKSVKPC 

T nnPiftTlQTI HnnFTl PSNPADLFHWLP 

KEHMCVLVYLVTVMHSMQAGYLEKAQKYT 

DKAU^QLBKUCMIJ^CSPILSSFQVILLEHnM 

CRLVTGHKATALQHSQVCQLCQQSPRLFSN 

HAAQLHTLLGLYCVSVNCMDNAEAQFTTAL 

RLTNHQELWAFIVTNLASVYIREGNRHQEVVX 

LYSLLERINPDHSFPVSSHaRAAAFYVRGLF 

SFFQGRYNEAKRFLRETLKMSNAEDLNRLTA 

CSLVLLGHIFYVLGNHRESNNMWPAMQLAS 

iaPDMSVQLWSSALLRDLNKACGNAMDAHE 

AAQMHQNFSQQLLQDHIEACSLPEHNLITWT 

DGPPPVQFOAQNGPNTSLASLL 


835 


2185 


A 


6855 


334 


1268 


PTRRPILPLTSPKAISVPSPLQGKQHTLVKSCL 

SVSGIGOaVSLSSRMKLQTLAVSVTALKFWS 

AYVPCQTQDRDALRLTLEQIDLIRRMCASYSE 

LELVTSAKALNDTQKLACXIGVEGGHSLDNS 

LSILRTFYMLGVRYLTLTHTCNTPWAESSAK 

GVHSFYNNISGLTDFGEKWAEMNRLGMMV 

DLSHVSDAVARRALEVSQAPVIFSHSAARGV 

CNSARNVPDDILQLLEEERWAFVMVSLFHGE 

LIQWQPIRPMCSTVADHFDHIKAVMGSKFIGI 

GGDYDGAGKYRKKTTCICAPWRTSSRMSS 


836 


2186 


A 


6862 


315 


n 


PPRSRPSCWRKKVGPGRPWWWGGTGPPCjQCj 
RPEIRLLPLPMTGACGAVAASRTGSSGPG/SSL 
PNGHGGKGSGLANGLAGNP\GHLGLOSSFGT 
GPGSGRPPP 


837 


2187 


A 


6863 


2 


1615 


VLRGQRGPAGGLAEERRRGRNEWRIHD V IT 

APFPGLVQRRSRLLIVSQVRYFLKNKVSPDLC 

NEDGLTALHQCCIDNFEEIYKLLLSHGANVN 

AKDNELWTPLHAAATCGHINLVKILVQYGA 

DLLAVNSDGNMPYDLCEDEPTLDVIETCMAY 

QGITQEKINEMRVAPEQQMIADIHCMIAAGQ 

DLDWIDAQGATLLHIAGANGYLRAAELLLDH 

GVRVDVKDWDGWEPLHAAAFWGQMQMAE 

LLVSHGANU.NARTSMDEMPIDLCEEEEFKVL 

LLELK\HKHDVIMKSQLRHKSSLSRRTSHRQA 

S/SVGKVVRRTQPVGTGPNL\YRKEYE/GEEAI 

LWQRSA\AEDQRTSTYNGDIRE1\RTDQENKD 
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PNPRLEKNPVLLSEFPTKIPRGELDMPVENULK 

APVSAYQYALANGDVWKVHEVPDYSMAYG 

NPGVADATPPWSSYKEQSPQTLLELKRQRAA 

AKLLSHPFLSTHLGSSMARTOESSSEGKAPLI 

GGRTSPYSSNGTSVYYTVTSGDPPLLKFKAPI 

EEMEEK.VHGCCRIS 


838 


2188 


A 


6865 


6291 


739 


AGPLEPRVQGAMALQLWALTLLGLLGAGAS) 

LRPRKLDFFRSEKELNHLAVDEASGWYLGA 

VNALYQLDAKLQLEQQVATGPVLDNKKCTP 

PIEASQCMEAEMTDNVNQLLLVDPPRKRLVE 

CGQLLKGIXCALRALSNISLRLFYEDGSGEKSF 

VASNDEGVATVGLVSSTGPGGDRVLFVGKG 

NGPHDNGUVSTRLLDRTDSREAFEAYTDHAT 

YKAGYLSTNTQQFVAAFEDGPYVFFVFNQQD 

KHPARNRTLXARMCREDFNYYSYLEMDLQC 

RDPDIHAAAFGTCLAASVAAPGSGRVLYAVF 

SRDSRSSGGPGAGLCLFPLDEVHAKMEANRN 

ACYTGTREARDIFYKPFHGDIQCGGHAPGSSK 

SFPCGSEHLPYPLGSRDGLRGTAVLQRGGLN 

LTAVTVAAENNHTVAFLGTSDGRILKVYLTP 

DGTSSEYDSILVEINKRVKRDLVLSGDLGSLY 

AMTQDKVFRLPVQECLSYPTCTQCRDSQDPY 

CGWCVVEGRCTRKAECPRAEEASHWLWSRS 

KSCVAVTSAQPQNMSRRAQGEVQLTVSPLPA 

LSEEDELLCLFGESPPHPARVEGEAVICNSPSS 

IPVTPPGQDHVAVTIQLLLRRGNIFLTSYQYPF 

YDCRQAMSLEENLPCISCVSNRWTCQWDLR 

YHECREASPNPEDGIVRAHMEDSCPQFLGPSP 

LVIP^INHETDVNFQGKNLDTVKGSSLHVGSD 

LLKFMEPVTMQESGTFAFRTPKLSHDANETL 

PLHLYVKSYGKNIDSKLHVTLYDCSFGRSDC 

SLGRAANPDYRCAWCGGQSRCVYEALCNTT 

SECPPPVTTRIQPETGPLGGGIRmLGSNLOVQ 

AGDIQRISVAGKNCSFQPERYSVSTRIVCVIEA 

AETPFTGGVEVDVFGKLGRSPPNVQFTFQQP 

KPLSVEPQQGPQAGGTTLTIHGTHLDTGSQED 

VRVTLNGVPCKVTKFGAQLQCVTGPQATRG 

QMLLEVSYGGSPVPNPGIFFTYRENPVLRAFE 

PLRSFASGGRSINVTGQGFSLIQRFAMW1AEP 

LQSWQPPREAESLQPMTVVGTDYVFHNDTK 

WFLSPAVPEEPEAYNLTVLIEMDGHRALLRT 

EAGAFEYVPDPTFENFTOGVKKQVNKLIRAR 

GmNKAMTLQEAEAFVGABRCTMKrLTET 

DLYCEPPEVQPPPKRRQKRDTTHNLPEFIVKF 

GSREWVLGRVEYDTRVSDVPLSLILPLVIVPM 

WVIAVSVYCYWRKSQQAEREYEKIKSQLEG 

LEESVRDRCKKEFTOLMIEMEDQTNDVHEAG 

IPVLDYKTYTDRVFFLPSKDGDKDVMITGKL 

DIPEPRRPWEQALYQFSNLLNSKSFLINFIHT 

L\ENQPEFSARAKVYFASLLTVALHGKLEYYT 

DIMHTLFLELLEQYVVAKIOTCLMLRRSETVV 

ERMLSNWMSICLYQYLKDSAGEPLYKLFKAI 

KHQVEKGPVDAVQKKAKYTLNDTGLLGDD 

VEYAPLTVSVIVQDEGVDAIPVKVLNCDTISQ 

VKEKIIDQVYRGQPCSCWPRPDSWLEWRPG 

STAQE.SDLDLTSQREGRWKRVNTLMHYNVR 

DGATLELSKVGVSQQPEDSQQDLPGERHALL 

EEENRVWHLVRPTDEVDEGKSKRGSVKEKE 

RTKATTEIYLTRLLSVKGTLQQFVDNFFQSVL 

APGHAVPPAVKYFFDFLDEOAEKHNIQDEDn 
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HIWKTNSU»LRFWVNIUa^HFiroVHVHEW 

DASLSmQTFMDACTRTEHKLSRDSPSNKLL 

YAKEISTYKKMVEDYYKGIRQMVQVSDQDM 

NTHLAEISRAirrDSLNrLVALHQLYQYTQKY 

YDEIINALEEDPAAQKMQLAFRLQQIAAALE 

NKVTDL 


839 


2189 


A 


6872 


1 


1483 


RARRLALQCHVCVCALTPGEQSGRRLPGQT 

WLMFSCFCFSLQDNSFSSTTVTECDEDPVSLH 

EDQTDCSSLRDENNKEaOYPDAOALVEEHAPP 

SWEPQQQNVEATVLVDSVLRPSMGNFKSRKP 

K STFK AESGRSHGESOEratrVVSSQSECQVRA 

GTPAHESPQNNAFKCQE'nVRL\QPRIDQRTAT 

SPKDAFETR\QDLNEEEAAQVHGVKDPAPAS 

TQSVLAVDGTDSADPSPVHKDGQNEADSAPE 

DLHSVGTSRLLL/YHITDGDNPTAVRHGCSL/F 

SGQSQRFNLDFESAPSPPSTXJQFMMPRSSSRC 

SCGDGKEPQTITQLTKHIQSLKRKIRKFEEKFE 

QEKXYRPSHGDKTSNPEVLKWMNDLAKGRK 

QUCELKUCLSEEQGSAPKGPPKNLLCEQPTVP 

RENGKPEAAGPEPSSSGEETPDAALTCLKERR 

EQLPPQEDSKVTKQDKNLIKPLYDRYRIIKQIL 

STPSUPTIVSODTCMLLLCroV 


840 


2190 


A 


6873 


2 


"2054 


nTRFYFSFIRLFAMSLADLTKTNIDEHFFGVAL 

ENNRRSAACKRSPQTGDFSRNSNASNKSVDY 

SRSQCSCGSLSSQYDYSBDFLCDCSEKAINRN 

YLKQPWKEKEKKKYNVSKISQSKGQKEISV 

EKKHTWNASLFNSQIHMIAQRRDAMAHRILS 

ARLHKIKGLKNELADMHHKLEAILTENQFLK 

QLQLRHLKAIGKYENSQNNLPQIMAKHQNEV 

KNLRQLLRKSQEKERTLSRKLRETDSQLLKT 

KJDILQALQKLSEDKNIAEREELTHKXSIirnC 

MDANDKKIQSLEKQLRLNCRAFSRQLAIETR 

KTIJWIOTATKTLOVEVKHLQQKLKJEKDREL 

EIKKIYSHRILKNLHDTEDYPKVSSTKSVQAD 

RKILPFTSMRHQGTQKSDVPPLTTKGKKATG 

NIDHKEKSTEINHEIPHCVNKLPKQEDSKRKY 

EDLSGEEKHLEVQILLENTGRQKDKKEDQEK 

KNIFVKEEQELPPKUEVIHPERESNQEDVLVR 

EKFKRSMQRNGVDDT\LGKGTAPYTKGPLRQ 

RRHYSFTEATENLHHGLPASGGPANAGNMR 

ySHSTGKHLSNREEMELEHS\DSGYEPSFGKS 

SRIKVKD1TFRDKKSSLMEELFGSGYVLKTD 

QSSPGVAKGSEEPLQSKESHPLPPSQASTSHA 

FGDSKVTWNSIKPSSPTEGKRKin 


841 


2191 


A 


6874 


3 


2867 


SSRTREMEEKEILRRQlRLLQGLIDDYKTLHCi 

NAPAPGTPAASGWQPPTYHSGRAFSARYPRP 

SRRGYSSHHGPSWRKKYSLVNRPPGPSDPPA 

DHAVRPLHGARGGQPPVPQQHVLERQVQLS 

QGQNWKVKPPSKSGSASASGAQRGSLEEFE 

DTPWSDQRPREGEGEPPRCiQLQrbKFi KAKvj 

TCSVEDPLLVCQKEPGKPRMVKSVGSVGDSP 

REPRRTVSESVIAVKASFPSSALPPRTGVALG 

RKLGSHSVASCAPQLLGDRRVDAGHTDQPVP 

SGSVGGPARPASGPRQAREASLWTCRTNKF 

RKNNYKWVAASSKSPRVARRALSPRVAAEN 

VCKASAGMANKVEKPQLIADPEPKPRKPATS 

SKPGSAPSKYKWKASSPSASSSSSFRWQSEAG 

SKDHASQLSPVLSRSPSGDVRPALAHSGLKPLS 

GETPLSAYKVKTRTKIIRRRGSTSLPGDKKSG 

TSPAATAKSHLSLRRRQAIJRGKSSPVLKKTPN 
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KGLVQVTiaiRU:RlJ^PSRAHLPTKEASSLHA 

VRTAPTSKVIKTRYRIVKKTPASPLSAPPFPLS 

LPSWRARKLSLSRSLVLNRLRPVASGGGKAQ 

PGSPWWRSKGyRCI0GVLYKV5ANliXois.i 5u 

QPSDAGSRPLLRTGRLDPAGSCSRSLASRAVQ 

RSLAIIRQARQRREKRKEYCMYYNRFGRCNR 

GERCPYIHDPEKVAVCTRFVROTCKKTDGTC 

PFSHHVSKEKMPVCSYFLKGICSNSNCPYSHV 

YVSRKAEVCSDFLKGYCPLGAKCKKKHTLLC 

PDFARRGACPRGAQCQLLHRTQKRHSRRAAT 

SPAPGPSDATARSRVSASHGPRJKPSASQRPTR 

'qtpssaaltaaavaapphcpggsaspssskas 

SSSSSSSSPPASLDHEW>SLQEAALAAACSNR 

lcklpsfislqsspspgaqprvraprapltkds 
gkplhikprl 


842 


2192 


A 


689S 


506 


2071 


WPDLVHTWSSEEAMGSCCSCPDKDTVl'UNtl 

RNKFKVINVDDDGNELGSGIMELTDTBLILYT 

RKRDSVKWHYLCLRRYGYDSNLFSFESGRRC 

QTGQGIFAFKCARAEELFNMLQEIMQNNSIN 

VVEEPVVERNNHQTELEVPRTPRTPTTPGFAA 

QNLPNGYPRYPSFGDASSHPSSRHPSVGSARL 

PSVGEESTHPLLVAEEQVHTYVNTTGVQE^ 

knrtsvhvplearvsnaesstpkeepssiedr 

DPQILLEPEGVKFVLGPTPVQKQLMEKEKLE 

OLGRDQVSGSGANNTEWDTGYDSDERRDAP 

SVNKLVYENINGLSIPSASGVRRGRLTSTSTSD 

TQNINNSAQRRTALLNYENLPSLPPVWEARK 

LSRDEDDNUjPKTPSLNGYHNNLDPMHNYV 

OTENVTVPASAHKIEYSRRRDCTPTVFNFDIR 

RPSLEHRQLNYIQVDLEGGSDSDNPQTPKTPT 

TPLPQTPTRRTELYAVIDIERTAAMSNLQKAL 

PRDDGTSRNKTRHNSTADLPL 


843 


2193 


A 


6919 


2 


'm 


AORPGTTHASOWWlAVQbLRLb V Uilf f V 6iUk 
YTTACm-TTAAVQLEUTPFQLYFNPELIFKHF 
OlWRLITNFLFFGPVGFNFUmilFLYRYCRM 
LEEGSFRGRTADFVFMFLFGGFLKflLFCSLFVS 
lyVFLGPGLYNN/GSSMOGAEUEPLCPHELLRP 
SQLPGPLSALGAHGIFLWGELNHCGPFGYCS 
WTHIFFLGRCISOSTWWNKNSENTIYFESYF 


844 


2194 


A 


6928 


902 


366 


" "HRLCMPIQGACGERME/FSLLLPGLEUNUVii- 
AHCNLRLPGSSNSPASASQVAGITGVCHHAR 
LIFVFSVETGFLHAGQAGLELLTSGDPPASAS 
QSAGITGKSQHTRPGYEFnPYSAAQEDALKA 


845 


2195 


A 


6939 


"1660 " 


317 


■ LYPENLGESLFPILLLPPPWPDGGRPC(;v^i^a^> 
TRAKKLRRIWRILEEKESVAGAVQTLLLRSQE 
GGV\TSAAASTLSEPPRRTQESRTRTRALGLPT 
LPMEKLAASTEPQGPRPVLGRESVQVPDDQD 
FRSFRSECEAEVGWNLTYSRAOVSVWVQAV 
EMDRTLHKKO^MECCDVPAETLYDVLHDIE 
YRICKWDSNVIETFDIARLTVNADVGYYSWR 
CPKPLKNRDVITLRSWLPMGADYIIMNYSVK 
HPKYPPRKDLVRAVSIQTGYLIQSTGPKSCVn 
YLAOVDPKGSLPKWWNKSSQFLAPKAMKK 
MYKACLKYPEWKQKHLVPHFKPWLVHPEQSP 
LPSLALS\ELSVQHADS\LENIDESAVV\ESREE 
RVMGGAGGEG\SDDDTSLYAEAPHRFRETETG 
PGAGRALGAAAAPALSPLHPPGTWWHRABP 
RRVLQPGWTEPO 
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seq- 
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SEQ ID 
NO: of 
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seq- 
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hod 
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beginning 

nucleotide 
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ng to first 

amino acid 

residue of 

peptide 
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Predicted end 
nucleotide 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alanine OCysteine, 
I>>Aspartic Add, EpGhitamic Add, 
P»Piienylaianioe, G=Gtydne» H'^Histidine, 
Msoleucine, K=Cyslnc, D=Leudne, 
M=Metliionine, N=Asparngine, Ps=Proline, 
Q=Ghitamine, R«Argininc» S=Serinc, 
T^Threonine, V=Varmc, W=Tiyptophan, 
Y=TyroanCs X=Unlcnown, ♦•^top codon, 
^^ssible nudeotide dektion, V*possible 
nudeotide insertion 


846 


2196 


A 


6944 


42 


2672 


RJRKMAGCRGSLCCrCRWCCCCGERK'rRlPE 

ELTUGETQEEEDEILPRKDYESLDYDRCINDP 

YLEVLETMDNKKGRRYEAVKWMWFAIGV 

CTGLVGLFVDFFmFTQLKFOVVQfTSVEECS 

QKGCLALSaELLGFNLlFVFLESLLGLIEPVE 

AGSGITEGKCYLYARQVPGLVRLPTLLWKAL 

GVLLTVAAMLU\GLGSPMIHSOSWGAGLPQ 

FQSISLRKIQFNFPYFRSDRYGKVDKRDFVSAG 

AAAGVAAAFGAPIGGTLFSLEEGSSFWNQGL 

TWKVLFCSMSATFTLNFFRSGIQFGSWGSFQL 

PGLLNFGEFKCSDSDKKCHLWTAMDLGFFV 

VMGVIGGLLGATFNO-NKRI^YRMRNVHP 

KPKLVRVLESLLVSLVTTVVVFVASMVLGEC 

RQMSSSSQIGNDSFQLQVTEDVNSSnCTFFCP 

NDTYNDMATLFFNPQESAILQLFHQDGTFSPV 

ILALFFVLYFLLACWTYGISVPSQLFVPSLLC 

OAAFORLVANVLKSYlGLOHIYSOTrALIGAA 

AFLGGVVRMTlSLTVILIEST\NErrYGLPIMVT 

LMVGKWTGDFFNKGIVYDIHVGLRGVPLLEW 

ETEm^DKLRASDIMEPNLTYVYPHTRIQSLV 

SILRTTVHHAFPVVTENRGNEKEFMKGNQLIS 

NNIKFKKSSE.TRAGEQRKRSQSMKSYPSSEL 

RNMCDEHIASEEPAEBCEDLLQQMLERRYTPy 

PNLYPDQSPSEDWTMEERFRPLTFHGLILRSQ 

LVTLLVRGVCYSESQSSASQPRLSYAEMAED 

YPRYPDMDLDLTLLNPRMIVDVTPYMNPSPF 

TVSPNTHVSQVFNLFRTMGLRHLPVVNAVGE 

IVGUTRHNLTYEFLQARLROHYQTI 


847 


2197 


A 


6951 


3 


1994 


NTNSSSVTNSAAGVEDLNIVQVTVPDNEKER 

LSSIEKIKQLREQVNDLFSRKFGEAIGVDFPVK 

VPYRKTITNPGCVVIDGMPPGVVFKAPGYLEI 

SSMRRILEAAEFIKFTVIRPLPGLELSNGEYST 

VGKRKIDQEGRVFQEKWERAYFFVEVQNIST 

CLICKRSMSVSKEYNLRRHYQTNHSKHYDQY 

MERMRDEKLHELKKOLRKYLLGLSDTECPE 

QKQVFANPSPTQKSPVQPVEDLAGNLWEBXR 

EKIRSFVAYSIAIDEITDIN^r^^QLAIFIRGVDE 

•vnyrxirccrcT t nT\rD"\>rmTfcrQn'KIPTl?<!'HVF'SfST.lf 
NFDVSlibLLU 1 VrM l\Ji JPKOVJlNrur oJs.y Ctsu^un, 

NFCINWSKLVSVASTGTPPMVDANNGLVna 

KSRVATFCKGAELKSICCnHPESLCAQNKLKM 

nHVMnwvK! 9 VNWrrSRGLNHSEFTTLLYEL 

DSQYGSLLYYTEIKWLSRGLVLKRPFESLEEI 

DSFMSSRGKPLPQLSSIDWIRDLAFLVDMTM 

HLNALNISLQGHSQIVTQMYDLIRAFLAKLCL 

WETHLTRNNLAHFPTLKLVSRNESDGLNYIP 

KIAELKTEFQKRLSDFKLYESELTLFSSPFSTKI 

DSVHEELQMEVIDLQCNTVLKTKYDKVGBPE 

FYKYLWGSYPKYKHHCAKILSMFGSTYICEQ 

LFSIMKLSKTKYCSQLKDSOWDSVLHIAT 


848 


2198 


A 


6985 


3 


289 


SVQYLPGRPTRTHASTDAPLMLKFTPLPSKTK 
ASAPVQCLLLMAATFSPQGLAKPHSGTIPmC 
CFNAINTKIPIQRLESYTRITNIQCPKEAVM 


849 


2199 


A 


6999 


963 


5 


LDFLCHRDMGDNTTSITEFLLLGFPVGPRIQM 
LLFGLFSLFYVFaLGNGTILGLISLDSRLHAP 
MYFFLSHLVAVVDIAYACNTVPRMLVNLLHP 
AKPISFAGRMMQTFLFSTFAVTECLLLWMS 
YDLYV\AICHPUlYLAINnmVCITLAVTSWT 

tgvllslihlvlllplpfcrpqkiyhffceila 

vlklacadte™enmvlagaisglvgplstiv 

vsymcilcailqiqsrevqrkafcrcfshlcvi 
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nucl- 
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seq- 
uence 


SEQ ID 
NO: of 
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seq- 
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Met 
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in 
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rTeoicieu 

beginning 

nucleotide 
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correspond! 

ng to first 

amino add 

residue of 
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sequence 


J^reuivicu cuu 

nucleotide 
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Amino &cid seauence f A^Alsnine 0=Cysteine. 
D»Aspartic Add, EF<jlutamic Acid, 
F»Phenylalanine, G^lycioe, I^Histidine, 
I=>Isoleucine, K^Lysine, LpLeudne, 
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T=Thrconinc, V«Valine, W'=^iyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon, 
A^ossible nucleotide deletion, V=possible 
nucleotide insertion 














GLFYGTAIIMYVGPRYGNPKEQKKYLLLFHS 
LFNPMLNPLICSLRNSEVKNTLIGIVLGVERAL 


S50 


2200 


A 


7001 


1 


1011 


MGNDSVSYEYGDYSDLSDRPVDCLDGACLAI 

DPLRVAPLPLYAAIELVGVPGNAMVAWVAG 

KVARRRVGATWLLHLAVADLLCCLSLPILAV 

piargghwpygavgcralpsnlltmyasvll 

ij\alsadu:flalgpaw\clrfs/gacgvqva 

(xuawtlau-ltvpsaiyrrlhqehfparlq 

cwdyggssstenavtairelfgflgplvava 

schsallcwaarrcrplgtaiwqffvcwap 

yhllglvltvaapnsallaralraeplivgl 

AIjUISCXNPMIJLYTORAQLRRSLPAACHW 

alresqgqdesvdskkstshdlvsemev 


851 


2201 


A 


7011 


1 


2310 


aaasplrmsrkgpraeVcAdcSapdpgwasi 

srgvlvcdeccsvhrslgrhisivkhlrhsa 

wpptllqmvhtlasnoansiwehsixdpaqv 

qsopalkqtpkdkwhpksefirakyqmlaf 

vhklpc31dddgvtakdlskqlhssvrtgnle 

tcija.lsuiaqanffhpekgttplhvaakag 

qtlqaellwygadpgspdvngrtptoyarq 

aghhelaerlvecqyeltdrlafylcgrkpd 

hknghyiipqmadsldlselakaakkklqal 

snrlfeelamdvydbvdrrendavwlatqn 

hstlvtersavpflpvnpeysatrnqgrqkl 

arfnarefatlndblseakrrqqgkslssptd 

nlelslrsqsdlddqhdydsvasdedtdqep 

LRSTGAraSNRARSMDSSDLSDGAVTLQEYL 

ELKKALATSEAKVQQLMKVNSSLSDELRRLQ 

REMKLQAENLQLRQPPGPVPTPPLPSERAEH 

TPMAPGGSTHRRDRQAFSMYEPGSALKPFGG 

PPGDELTTRLQPFHSTELEDDAIYSVHVPAGL 

YRIRKGVSASAVPFTPSSPLLSCSQEGSRHTSK 

LSRHGSGADSDYENTQSGDPLLGLEGKRFLE 

LGKEEDFHPELESLDGDLDPGLPSTEDVILKT 

EQVTKNIQELLRAAQEFKHDSFVPCSEKIHLA 

VTEMASLFPKRPALEPVRSSLRLLNASAYRLQ 

SECRKTVPPEPGAPVDFQLLTQQVIQCAYDIA 

KAAKQLVrnTREKKQ 


&S2 


2202 


A 


7016 


484 


1777 


■ RISKIQVYYSTGYSSRKMNPTLGLAIFLAVLL 
TVKGLLKPSFSPRNYKALSEVQGWKQRMAA 

vcr A Oi^xTX/irM nnvi T Wl ATrVKTPnP'MTPI 
JiJlLAKV^WMlyLUrJ\JLJL*l\J\^ I fNrvJrvnirJUor 

LSISTAFSMLCLGAQDSTLDEIKQGFNFRKMP 

EKDLHEGFHYIIHELTQKTQDLKLSIGNTLFID 

QRLQPQRKFLEDAKNFYSAETILTNFQNLEM 

AQKQINDFI/ESKTHGKINNLIENIDPQTVMLL 

ANYIFFRARWKHEFDPNVTKEEDFFLEKNSS 

VKVPMMFRSGIYQVGYDDKLSCTILEIPYQK 

NITABFDLPDEGKLKHLEKGLQVDTFSRWKTL 

LSRRWDVSVPRLHMTGTFDLKKTLSYIGVS 

KIFEEHGDLTKIAPHRSLKVGEAVNKAELKM 

DERGTEGAAGTGAQTLPMETPLWKIDKPYL 

LLIYSEKEPSVLFLGKIVNPIGK 


8S3 


2203 


A 


7017 


1 


3293 


MTHACNPSTLGGQGRRITRSHGRRRSSRGPV 

ARHVAAGAGHENKHGGSRRFPAGVAPRRAM 

ANVSKKVSWSGRDRDDEEAAPLLRRTARPG 

GGTPLLNGAGPGAARQSPRSALFRVGHMSSV 

ELDDELLEP\DMDPPHPFPKEIPHNEKLLSLKY 

ESLDYDNSENQLFLEEERRINHTAFRTVEIKR 

WVICALIGILTGLVACFIDIVVENLAGLKYRVI 

KGSILPNIDKFTEKGGLSFSLLLWATLNAAFV 
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Amino acid sequence (A^Alaniae OCy steine, 
I>Aspartic Acid; E^lotamic Acid, 
F°Phenylaianinei G^lycine, H=Histidiiie, 
I»Isoleucine, K^Lysine, L^Leucine, 
M-Methionine, NsAsparagine, P=ProliDe, 
Q=Glutamln^ R=Arginine, S=Scrine, 
.T=Threonin^ V=Valine, W«=Tryptophan. 
Y»Tyrosine, X»Unknown, *sStop codon, 
/=possible nucleotide deletion, V^ossible 
nucleotide insertion 














■ LVOSVI VAFIEPVAAGSOIPQIKCFLNGVKIPH 
WRLKTLVKVSGVILSVVOOLAVGKEGPMI 
HSGSVIAAGISQGRSTSLBOlDFKIFEyERRDTE 
KRDFVSAGAAAGVSAAFGAPVGGVLFSLEEG 
ASFWNQFLTWRIFFASMISTlTrLNFVLSIYHG 
NMWDI^SPGLINFGRroSEKMAYnHEIPVFI 
AMGWGGVLGAVFNALNYWLTMFRIRYIHR 
PCLQVIEAVLVAAVTATVAFVLIYSSRDCXJPL 
QGGSM5YPLQLFCADGEYNSMAAAFFNTPEK 
SWSLFHDPPGSYNPLTLGLFTLVYEFLACWT 
YGLTVSAGVFIPSLLIGAAWGRLFGISLSYLTa 
AAIWADPGKYALMGAAAQ1X3GIVRMTLSLT 
VIN4MEATSNVTYGFPIMLVLMTAKIVGDVFIE 
GLYDMHIQLQSVPFLHWEAPVTSHSLTAREV 
MSTPVTO-RRREKVGVIVDVLSDTASNHNGF 
PWEHADDTQPARLQGLILRSQLIVLLKHKVF 
VERSNLGLVQRRLRLKDFRDAYPRFPPIQSIH 
VSQDERECTMDLSEFMNPSPYTVPQEASLPR 
VFKLFIUaGIilHL\nnn>NRNQWGLVTRKD 
LARYRLGKRGLEELSLAQTGPKAQATAEGRV 
AGAAQQPCXJLRAVTLEDUSIXLAGGLASPEP 
LSLEELSERYESSHPTSTASVPEQDTAKHWNQ 
LEQWWELQAEVACLREHKQRCERATRSLL 
REIXQVRARVQU3GSELRQLQQEARPAAQAP 
EKEAPEFSGLQNQMQALDKRLVEVREALTRL 
RRRQVQQEAERRGAEQEAGLRLAKLTDLLQ 
QEEQGREVACGALQKNQEDSSRRVDLEVAR 
M 


854 


2204 


A 


7037 


139 


2604 


AGTWEPRPYDQAKETGAPGSQPPVPPMELRP 

WLLWWAATOTLVLLAADAQGQKVFTNTW 

AVRIPGGPAVANSVARKHGFLNLGQIFGDYY 

HFVraRGVTKRSLSPHRPRHSRLQREPQVQWL 

EQQVAKRRTKRDVYQEPTDPKFPQQWYL\SG 

VTQ\RDLMVKAAWAQGYTGHGrvVSILDDGI 

EKNHPDLAGNYDPGASFDVNDQDPDPQPRY 

TQMNDNRHGTRCAGEVAAVANNGVCOVGV 

AYNARIGGVRMLDGEVTDAVEARSLGLNPN 

HIHIYSASWGPEDDGKTVDGPARLAEEAFFR 

GVSQGRGGLGSIFVWASGNGGREHDSCNCD 

GYTNSIYTLSISSATQFGNVPWYSEACSSTLA 

TTYSSGNQNEKQIVTTOLRQKCTESHTGTSAS 

APLAAGIIALTLEANKNLTWRDMQHLWQTS 

KPAHLNANDWATNGVGRKVSHSYGYGLLD 

GKRLEVRKTVTACLGEPNHITRLEHAQARLT 

LSYNRRGDLAIHLVSPMGTRSTLLAARPHDY 

SADGFNDWAFMTTHSWDEDPSGEWVLEIEN 

TSEANNYGTLTKFTLVLYGTAPEGLPVPPESS 

GCKTLTSSQACWCEEGFSLHQKSCVQHCPP 

GFAPQVLDTHYSTENDVETIRASVCAPCHAS 

CATCQGPALTDCLSCPSHASLDPVEQTCSRQS 

QSSRESPPQQQPPRLPPEVEAGQRLRAGLLPS 

HIPEWAGLSCAFIVLVFVTVFLVLQLRSGFS 

FRGVKVYTMDRGLISYKGLPPEAWQEECPSD 

SEEDEGRGERTAFDCDQSAL 


855 


2205 


A 


7058 


3 


1441 


■bRPASQLLAPFAAEALPGAPRAAMAQHhSLA 
ACDWGFDLDHTLCRYNLPESAPLIYNSFAQF 
LVKEKGYDKELLNVTPEDWDFCCKGLALDL 
EDGNFLKLANNOmRASHGTKMMTPEVLA 
EAYGKKEWKHFLSDTGMACRSGKYYFYDN 
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seq- 
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NO: of 
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seq- 
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ID NO: 
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09/496 
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914 


ng to first 




amino acid 




residue of 




peptide 
sequence 
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nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



Amino acid sequence (A«Alanine CXJysteme, 
D-Aspartic Acid, EHjlutamic Acid, 
F=Phenylalanine, G=Glycinc. H=Histidine» 
I=Isoleucine, K^Lysine, LpLeuclne, 
M«Mcthionine, N=Asparaginc, P^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W^Tryptophan, 
Y=Tyrosine, X«=Unknown, *=Stop codon, 
A=possible nucleotide deletion, V^ossible 

nucleotide ins ertion • 

iARVVDYLTKLNNGQK'imW 



SS6 



857 



858 



859 



860 



7082 



2207 



2208 



2209 A 



2210 



7088 



7091 



7136 



7156 



396 



320 



23 



1635 



2417 



415 



302 



591 



YFDLPGALL 

KDrVAAIQHNYKMSAFKENOGIYFPEKRDPG 
RYLHSRPESVKKWLRQLKNAGKILLLrrSSHS 
DYCRLLCA\mGNDFTDLFDIVITNAJLKPGFF 
SHLPSQRPFRTLENDEEQEALPSLDKPGWYSQ 
GNAVHLYELLiCKMTGKPEPKWYFGDSMHS 
DIFPARHYSNWETVLILEELRGDEGTRSQRPE 
ESEPLEKKGKYEGPKAKPLNTSSKKWGSFFU 
DSVLGLENTEDSLVYTWSCKRISTYSTIAIPSI 
EAIAELPLDYKFTRFSSSNSKTAGYYPNPPLV 

LSSDETUSK „ 

" SSPSVFEFEHAVQPVFTMEFLKTCVLRRNACr 
AVCFWRSKWQKPSVRRISTTSPRSTVMPAW 
xaDKYGKNEVLRFTQNMMMPIIHYPNEVIVK 
\Wu\SVNPlDVNMRSGYGATALNMKRDPLH 
VKIKGEEFPLTLGRDVSGWMECGLDVKYFK 
PGDEVWAAVPPWKQGTLSEFVWSGNEVSH 
KPKSLTHTQAASLPYVALTAWSAINKVGGLN 
DKNCTGKRVLILGASGGVGTFAIQVMKAWD 
AHVTAVCSQDASELVRKLGADDVIDYKSGSV 
EEQLKSLKPFDFILDNVGGSTETWAPDFLKK 
WSGATYVTLVTPFLLNMDRLGIADGMLQTG 
VTVGSKALKHFWKGVHYRWAFFMASGPCX 
DDIAELVDAGKIRPWEQTFPFSKVPBAFLKV 

ERGHAR GKTVINW 

tLFLLSLLFLVQGAHGR 



LRRRKMTPQSLLQT 

GHREDFRFCSQRNQTHRSaHYKPTPDLRISIE 
NSEEALTVHAPFPAAHPASRSFPDPRGLYHFC 
LYWNRHAGRLHLLYGKRDFLLSDKASSLLCF 
QHQEESLAQGPPLLATSVTSWWSPQNISLPSA 
ASFTFSFHSPPHTGAHNASVDMCELKRDLQL 
LSQFLKHPQKASRRPSAAPASQQLQSLESKLT 
SVRFMGDMOSFEEDRINATVWKLQPTAGLQ 
DLHIHSRQEEEQSEIMEYSYLLPRTLFQRTKG 
RSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 
KVLGIWQNTKVANLTCPWLTFQHQLQPKN 
VTLQCVFWVEDPTLSSPGHWSSAGCETVRRE 
TQTSCFCWHLTYFAVLMVSSYEVDAVHKHY 
LSLLSYVGCWSALACLVTIAAYLCSRVPLPC 
RRia*RDYTIKVH]V!NLLLAVFLLDTSFLLSEPV 
ALTGSEAGCRASAIFLHFSLLTCXSWMGLEG 
YNLYRLWEVFGTYVPGYIXKLSAMGWGFPI 
FLVTLVALVDVDNYGPnLAVHRTPEGVEYPS 
MCWIRDSLVSYITNLGLFSLVFLFNMAMLAT 
MWQILRLRPHTQKWSHVLTaCLSLVLG\LP 
WALIFFSFASGTFQLWLYLFSirrSFQGFUH 
WYWSMRLQARGGPSPLKSNSDSARLPISSGS 

TSSSRI 

DAGAVKSSD'JNIWFRGMCDDKKGHRCPS'U 
QPQHFHVAFHTEAEGAMFYFRLHVIHRVMQS 

OOQLFPSTLF SWLLE 

FFFWRQSLALU^RLECSGATG^^^ 

DCPTSAS*IAGrrGACyHAWLLFVFLAETGFH 

HVGQGGLaLTSSDPSGSASQSAGrrGVSHCT 

WPI 



ALSTETRTPDMRRLLLVTSLVVVIJLWEAGAY 



PAPKVPIKMQVKHWPSEQDPEKAWQARWE 
PPEKDDQLWLFPVQKPKLLTTEBKPRGQGR 
rwPrLPGTKAWMETEDTLGRVLSPEPDHDSLY 
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Amino acid sequence (A^Alaninc C=Cysteine, 
I^Aspartic Acid, &=01utamic Acid, 
F^Phenylalanine, Q=01ycinc, H^Hislidinc, 
I«Isoleucme» KpLysine, LpLcucinCi, 
M=Mctliioninc, N=Asparaginc, P=Pro!ine, 
QKJlutamine, R«TAfginin€; S^Scrine, 
T=Threonine, V=Va!ine, W=Tiyptophan. 
Y=T>TOsine^ X«Unknown, *=Stop codwi, 
possible nucleotide deletion, V=possible 
nucleotide insertion 














HPPPEEDQGEERPRLWVMPNHQVLLGPBliUg 
DHIYHPQ*GSRGHHCPRPVPRPRLLGLGPSLP 
CPS 


861 


2211 


A 


7161 


1220 


1003 


■NYVCriAF*EKKMGPLSLSCLVLLFVLFLL>U 

LTTTTRIMFH(nVIJASVamNm 

KSAMILO 


862 


2212 


A 


7211 


665 


847 


LKYYHlTMGlYKTGKKVIL*KSSMSNRFSVff 
YKNIOKLSFSNYVYHQNYVFSSDWSYDF 


863 


2213 


A 


7212 


924 


1273 


HGSSCALGDLAPG*LPSGPVLSSPAVRL*RKP 
T \AimQpcPT PATGPT*G1 VI>VLGGPDCT*WA 
RGQHEHK3UV1RAP*SCRVTVNLAKKKI^ 
rnrPMvnsPPKECDYNILANSVA 


s6a 


2214 


A 


7214 


845 


1619 


SDKGGKKADRKJmLRHAFPLLPHRVRJBKLH 
DPKVPVDADHVQGQDPGRAAHDIHGEDVTE 
KVSKDPLAPDEVGDTDEGHDRHQHREVGQR 
. HGHDQEEVAYEERACEGGKFATVEVTDKPV 
DEALREAMPKVAKYAGGTNDKGIGMGMTV 
PISFAVFPNEDGSLQKKLKVWFRIPNQFQSDP 
PAPSDKSVKIEEREGrrVYSMQFGGYAKEAD 
v\; An ATDT p A AT FfrTATYRGDIYFCTGYDPP 
MKPYGRRNEIWLLKT 


865 


2215 


A 


7246 


559 


682 


RRLGAVAHAYTSSTLGGRGGWTPGQELgi^ 

LAInMAjSjtKI-* r _ 


866 


2216 


A 


7257 


641 


1310 


TCTYKYLMGWIRGRKSKHSWEMSliFHNYNL 
DLKKSDFSTRWQKQRCPWKSKCRENASPFF 
i?nrFT A V AMnrRFms/TVAIWSAVFLNSLFNQEV 
QIPLTESYCGPCPKNWICYKNNCYQFFDESKN 
WYESQASCMSQNASLLKVYSKEDQDL3LKLV 
KSYHWMGLVHIPTNGSWQWEDGSILSPNLLT 
nEMQKGDCALYASSFKGYIENCSTPNTyiCM 
ORTV 


867 


2217 


A 


7288 


151 


396 


SDCHEAFGSNGPDFWFFKY WSP*LFRQQWF1 
MPFFQTLWLMNANRFCSIFTTTNVANNCWW 
TPYHCWLSVWCRCESHGI 


868 


2218 


A 


7298 


3 


272 


pdtviggrgsggkefgrwvlw*Vfe*rlgtp 

ifr,<!CPAGGSRMVSESD*EGRGC*ASYPCAC* 
AGS*WR*GSRPAGRGTPPRSLSHARPP 


869 


2219 


A 


7332 


1223 


332 . 


■ PRRDAEDRDESCLNPAFPIGLLHPNSVNSMAK 
FLTLCrWlXLLGPCHXATVRAECSQDCATCS 
YRLVRPADINFLACVMECBGKLPSLKIWETC 
KELLQLSKPELPQDGTSURENSKPEESHaA 
KRYGGFMKRYGGFMKKMDELYPMEPEEEA 
NGSEILAKRYGOFMKKDAEEDDSLANSSDLL 
KELLETG0NRERSHHQDGSDNEEEVSKRYGG 
FMRGLKRSPQLKEKAKBLQKRYGGFMRRVG 
PQKW*MtSPQNRYGGFLKRFAEALPSDEEGE 
SYSKEVPEMEKRYGGFMRF 


O Iv 


2220 


A 


7382 


216 


1018 


EIHQRLTERTQFLDESRKNPNS*QANLLRGGU 

AGQGRGREGAESGGSRGEGPGSDGRLPATUU 

FWSPRSQRRGCCGRRAPRPEAMENGAVYSPT 

TEEDPGPARGPRSGLAAYFFMGRLPLLRRVL 

KGLQLLLSLLAHCEEWSQCTLCGGLYFFEF 

VSCSAFLI^IXILIVYCTPFyERVDTTKVKSSD 

FYmX}TGCVFliASllFVSraDRTSAEIAAIVF 

GFIASFNIFLLDnTNILYEKRQESQLRKPENTr 

RAEALTEPLNA 


871 


2221 


A 


"7403 


3 


393 


■ ' SCAMCSGLL*LLLPIWLSWTLX}TRGSEPRSVN 
DPGNMSFVKETVDKLLTOFRCFREREAAPRR 
ALRGAALPGESEAGDPESLRSSVNADWIQYS 
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[ SEQ ID J 
NO: of 1 
nucl- 1 
eotide 

1 scqr 
1 uence 


SEQID 
MO: of 
peptide 
seq- 
uence 


J^et J 
tiod . ] 


SEQ 1 
[DNO: 1 
in 

USSN 
09/496 
914 


Tedictea 

beginning 

nucleotide 

location 

corrcspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


CTCCIICICU ClIU 

nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence QA^Aianine OCysteine, 
D=Aspartic Acid, EFGlutamic Add, 
F^Phenylalanine, G«Glydne, H=Histidine^ 
l=Isolcucine, K=Lysine, IM^cine, 
M-Methioninc, N-Asparaginc, P^Pioline, 
Q^Glutmnine, RaArgimne^ S^^erine, 
T^Threonine, V=Va!lnc. W«Tiyptophan. 
Y=Tyrosine, X«=Uiilaiown, •»Stqp codoo, 
/=^ossible nucleotide deletion, \=possible 
nucleotide insertion 














DLWEAEVSTFRCEAOFCQECFRTPGNQEKDO 


872 


2Z22 


A 


7413 


1061 


359 


FVDIVSWEFPHCPEARFPAQHGQDSKRLTLC 
PGGS*PQATLHLDRMRVSASFrKEIQVKKYK 
CGLKPCPANYFAFKICSGAANWGPTMCFED 
RMIMSPVKNNVGRGLNIALVNGTTOAVLOQ 
KAFDMYSGDVMHLVKFLKEIPGGALVLVAS 
YDDPGTKMNDESRKLFSDLGSSYAKQLGFRD 
^iwvPTGAKDLRGKSPFEOFLKEQPQTQNKYE 
nWPELLEMEGCMPPKPF 
■ n pAaHfift<;rT ♦SOHFGRLRWEDRLRLGVQ 


873 


2223 


A 


7429 


2242 


2394 


DHPGOHCETPSLLKIERKLF 

prvro/^VT WATf HT J^AS'i'kKAPOAECGMlSlTK 


874 


2224 


A 


7468 


146 


T94 


WQKIGVGITGFGIFFILFGTLLYFDSVLLAFGN 
LLaTGLSLnGLRKTPWFFFQRHKUCGTSFLL 
GGWIVLLRWPLLGMFLETYGFFSLFKGFFPV 
AF0FLGNVCNIPFLGALFRRLQ0TSSN4V*KTE 
MSSLNLDHWLKGAKREEWEPPPQSPALTHSP 
TYPGPPQVQKERNGAEQLTSNPQVDSRGCQE 
AEMOTPRRLGWGWYHaTLYLWEEK 


875 


222S 


A 


7498 


91 


251 


GEKPVPTWLQDEAGQWLLGFVAQPWCiWFCi 

SRRHEP»HGGVLFRLGPSAPPGKL 

vo/^T r«cr giirwrr^PinvrsVHrWLGTVVHAYNPN 


876 


2226 


A 


7544 


403 


587 


YSCLCrLrisjlll or JsiNdVCiiwijvji t ¥tirvi*'»*. 
lLGGOGGWIA*GQEFKTSLGNTVRPCLYK 


877 


2227 


A 


7566 


2 


940 


gcapdtrffvpepggrGaApwvalvarggu 
tfkdkvlvaarrnasawlweerygnttlp . 
mshagtgniwimisypkgrbilelvqkgipv 

TMTIGVGTRHVQEFlSGQSVVFVAIAFrTMMII 

SLAWLIFYYIQRFLYTGSQIGSQSHRKETKKVI 

GQLLLHTVKHGEKGIDVDAENCAVCIENFKV 

KDHRILPCKHIFHRICIDPWLLDHRTCPMCFa 

DVIKALGYWGEPGDVQEMPAPESPPGRDPAA 

NLSLALPDDDGSDESSPPSASPAESEPQCDPSF 

KGDAGEOTALLEAORSDSRHGOPIS 


878 




A 


7586 


315 


1232 
391 


■ ERSLLCKVDVRWIYVSEGTKTQRRHRQG5LR 
RGRMQAACWYVUTXQFIVYLVTCANLTOG 
GKSELLKSGSSKSTLKJfflWTESSKDLSISRLLS 
QTFRGKENDTOLDLRYDTPEPYSEQDLWDW 
LRNSTDLQEPRPRAKRRPIVKTOKFKKMFGW 
GDFHSNIKTVKLNLLITGKIVDHGNGTFSVYF 
RHNSTGQGNVSVSLVPPTKIVEFDLAQQTVID 
AKDSKSFNCWEVEKVDKAmnLOm^PSK 
TCYQEQTQSHVSWLCSKPFKVICrYISFYOT> 
YKLVOKVCPDYNYHSDTPYFPSG 

■ -TRSWKLKWWSPI'CLDQLNOSAPGNVFIHG 


879 
880 


2229 
2230 


A 
A 


7605 
7612 


479 
93 


659 


■ DAAVAMTAQGGLVANRGRRFKWAIELSUKj 
GGSRGRSDRGSGQGDSLYPVGYLDKQVPDTS 
VQETDRILVEKRCWDIALGELKQIPMNLFIMY 
MAGNTISIFPTMMVCMMAWKr iv^AU^a^^ i 
FKMLESSSQKFLQGLVYLIGNLMGLALAVYK 
CQSMGIJJTHASDWLAFIEPPBRMEFSGGGL 


881 


2231 


A 


7615 


291 


1452 


- sPQKTMRSHXrmrraVSSWPYSSHRMRFli 
NHSDQPPQNFSATPNVTTCPMDEKLLSTVLTT 
SYSVIFIVGLVGNIIALYVFLGIHRKRNSIQIYL 
LNVAIADLIXIF(XPFRIMYHINQNKWTLGVIL 
CKyVGTLFYMNMYISIILLGFlSLDRYIKINRSI 
OQRKArnXQSIYVCOVWMLALGGFLTKDIL 
TLKKGGHNSTMCFHYRDKHNAKGEAIFNFIL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

©NO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc C=Cysteine, 
D=»Aspartic Acid, E^lutamic Acid, 
F«Phcnylalaninc, GKjlycinc, H-Histidinc, 
IsTcoleucine K— Lvsine. Ij^LcuchXt 
M=Methionine, N=Asparagine, P=Prolinc, 
QKjlutaminc, R«Argininc, S=»Serine, 
T=Threonine, V=Valine, W^Tiyptophan, 
VsTvrosine. X=Unknown- *— StOD codos. 
A^possible nucleotide deletion, \ppossible 
nucleotide inso^on 














VVMFWLIFLLIXLSYIKIGKNLLMSKRRSia'l'W 

SGKYATTARNSFIYLIIFTICFVPYHAFRFIYISS 

nr NVSSCYWKEIVHKTNEIMLVLSSFNSCLDP 

VMYFLMSSNIRKIMCQLLFRRFQGEPSRSEST 

SEFKPGYSLHDTSVAVKIQSSSKST 

onAyf AT T VA>JKTyi l9AGLKEFSVLLNOOVFND~ 


882 


2232 


A 


7617 


67 


379 


PLVSEEDMVTWEDWMNFYINYYRQQVTGE 
PQERDKALQELRQELNTLANPFLAKYRDFLK 
SHELPSHPPPSS 


883 


2233 


A 


7622 


400 


215 


KVKTCRYNPKYSAANDTOFVDIPSREKDLAK 
AVATVGPISVAVGASHVFFQFYKKGKHLSS 


884 


2234 


A 


7638 


2640 


2861 


APVLILQMVKLSI VLl rv^r LfOHlAiVjVLr i jsju^v 
QHVKSVTCPCEYLRKVSECRQMGPGALEQFP 
GLSCHTSHSG 


885 


2235 


A 


7642 


201 


455 


PSRGKMELEAMSRiTSPVNrAVrl'nL.i V vi^ 
AIGMFFTAWFFVYEVTSTKYTRJDIYKELUSL 
VASLFMGFGVLFLLLWVGIYV 


i%6 


2236 


A 


7692 


61 


"569 


APENPFSRQHFNSETKVKLSLKrOrWLGNHA 

HLGEHFSTHHELGLSGKWGFLVKNILEVIRN 

GGMETRHPOKVSSWFHRWDSRAEQHNHAE 

HHEDVPQGDEDSKVSEAQQEFPDWTCAGLP 

GLLPKALRVLLFQLKVQHRPGIHQQRPEQQD 

VSDHRYGRSVRQNRK 


887 


2237 


A 


7693 


85 


315 


NPGCCLPVAMRTSYLLLFILCLLLSEMASGU 

NFLTGLGHRSDHYNCVSSGOQCLYSACPIFTK 

lOGTCYRGKAKCCK 


888 


2238 


A 


7702 


242 


1298 


APSHRRRYLSPSRSAUQLGNMALbKLUb V 

VLLITVLWEGIAVAQKTQDGQNIGIKHIPAT 

QCGIVAOITSNGGHFASPNYPDSYPPNKECIYI 

LEAAPRQRIELTFDBHYYIEPSFECRFDHLEVR 

DGPFGFSPLIDRYCGVKSPPLIRSTGRFMWIKF 

SSDBELEGLGFRAKYSFIPDPDFTYLGGILNPIP 

DCQFELSGADGIVRSSQVEQEEKTKPGQAVD 

CIWmAIPKAKIYLRFLDYQMEHSNECmiF 

VAVYDGSSSDENLKAKFCSTVANDVMLKTGI 

GVIRMWADEGSRIJ^RFRMLFTSFGGASPAQA 

ALSFCHSNMCINNSLVCNOVQNCAYPWDEN 

HC 


889 


2239 


A 


7707 


185 


2911 


CHYIMNPSTHHPASAGGSILGLFDFFGLOLOJJ 

MTMDALLARLKLLNPDDLREEIVKAGLKCGP 

ITSTTRFIFEKKLAQALLEQGGRLSSFYHHEA 

GVTALSQDPQRILKPAEONPTDQAGFSEDRDF 

GYSVGLNPPEEEAVTSKTCSVPPSDTDTYRAG 

ATASKEPPLYYGVCPVYEDVPARNERIYVYE 

NKKEALQAVKMIKGSRFKAFSTREDAEKFAR 

GICDYFPSPSKTSLPLSPVKTAPLFSNDRLKDO 

LCLSESETVNKERANSYKNPRTQDLTAKLRK 

AVEKGEEDTFSDLIWSNPRYUGSODNPnVQ 

EGCRYNVMHVAAKENQASICQLTLDVLENP 

DFMRLMYPDDDEAMLQKRIRYVVDLYLNTP 

DKMGYDTPLHFACKFGNADWNVLSSHHLI 

VKNSRNKYDKTPEDVICERSKNKSVEILKERIR 

EYLKGHYYVPLLRAEETSSPVIGELWSPDQTA 

EASHVSRYGGSPRDPVLTLRAFAGPLSPAKAE 

DFRKLWKTPPREKAGFLHHVKKSDPERGFER 

VGRELAHELGYPWVEYWEFLGCFVDLSSQE 

GLQRLEEYLTQQEIGKKAQQETGEREASCRD 

KATTSGSNSISVRAFLDEDDMSLEEKNRQNA 

ARNNSPPTVGAFGHmCSAFPLEOEADLIEAA 
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SEQID 5 
NO: of 1 
nucl' I 
eotide i 
seq- 
uence 


SEQID J 
^0: of t 
peptide 
seq- 
uence 


tod I 
i 
1 
< 


JEQ 1 I 
DNO: I 
n I 
JSSN 1 
}9/496 < 
?14 1 

j 
1 


Predicted J 
)eginning i 
lucleotide 1 
ocation 
:orrcspondi 
ag to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end ^ 
lucIeotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


^no acid sequence (A=Aianine C«Cysteme. 
[>»Aspartic Acid, &=<Thitamic Acid, 
F=Phenylalanine, G^lycine, H^Histidine, 
NIsolcucine, K-Lysine, b=Lcuciiic 
M=Methionine, N^Asparaginc, P=Proline, 
Q=Glutamine, R.°Arginine, S=Serine, 
T=Threonine, V=Valinc, W=Tiyptophan, 
Y=Tyrosine, X^Unknown, ♦^Stop codon, 
/=90ssible nucleotide deletion, V»posstbl6 
nucleotide insertion 












269 


EPGGPHSSRNGLCHPtNHSRTLAGKRPKAJr-K 
GEEAHLPPVSDLTVEFDKLNLQNIGRSVSKTP 
DESTKTKDQE.TSRINAVERDLLEPSPADQLG 
NGHRRTESEMSARIAKMSLSPSSPRHEDQLEV 
TREPARRLFLFOEEPSKLDQDVLAALECADV 
DPHQFPAVHRWKSAVLCYSPSDRQSWPSPAV 
KGRFKSQLPDLSGPHSYSPGRNSVAGSNPAKP 
GLGSPGRYSPVHGSQLRRMARLAELAAL 
"UHMPVIPALWEAEVGGLLKPRSSRSAWArJi 


190 
891 


2240 
2241 


A 
A 


7711 
7721 


360 
61 


1175 


KLPWEPSFLIKMQliKflODV i a /u-ioru-tr y u 

VSQYEKLDAGEQRLMNEAFQPASDLFGPTTL 

HSPSDWrrSHPEAPQDFEQFFSDPYRKTPSPN 

KRSIYIQSIGSLGNTRnSEEYIKWLTGYCKAYF 

YGUIVKLLEPVPVSVTRCSFRVNENTHNLQIH 

AGDILKFLKKKKPEDAFCVVGITMIDLYPRDS 

WNFVFGQASLTDGVGIFSFARYGSDFYSMHY 

KGKVKKLKKTSSSDYSIFDNYYIPEITSVLLLR 

SCKTLTHEIGHIFGLRHCQWLACLMQGSNHL 

EEADRRPLNLCPICLHKLQCAVGFSIVERYKA 

LYRWIDDESSDTPGATPEHSHEDNGNLPKPV 

FAFKEWKEWincaAVLQK 


892 


2242 


A 


7723 


2 


1650 


" SAPTAPARPCRAERGSGG0MLALLAA5VALA 
VAAGAQDSPAPGSRPVCTALPPEAVHAGCPL 
PAMPMQGGAQSPEEELRAAVLQLRETWQQ 
KETTLASARAIRELTGKLARCEGLAGGKARGA 
GATGKDTMGDLPRDPGHWEQLSRSLQTLK 
DRLESLEPLPAMPMQGOAQSPEEELRAAVLQ 

T T»r}fnn/r\OVim A Q AR ATRFT TOTCLARCEGL 

AGGKARGAGATGKDTMGDLPRDPGHWEQ 
LSRSLQTLKDRLESLEHQLRANVSNAGLPCff) 
FREVLQQRLGELERQLLRKGAELEDEKSLLH 
NETSAHRQKTESTLNALLQRVTELERGNSAF 
KSPNAFKVSLPLRTNYLYGEaKKlLPELYAFT 
ICLWLRSSASPGMOTPFSYAVPGQANEIVLIE 
WGNNPIELLINDKVAQLPLFVSDGKWHHICV 
TWTTRIMSMWEAFQDGKJCLGTGENLAPWHPI 
KPGGVLILGQEQDTVGGRFDATQAFVGELSQ 
FNIWDRVliUQEIVWANCSTNMPGNIIPWVD 
KNVDVFGGASKWPVETCEERIXDL 
' TTArn^AMmT>LTI_EMDLENT-l^l^^'PWELPRL 


893 


2243 


A 


7729 


3554 


2419 


DNYNDTSLVENHLCPATEGPLMASFKAVFVP 
VAYSLIFLLGVIGNVLVLVILERHRQmSSTET 
FLFHLAVADLLLVFILPFAVAEGSVGWVLGTF 
LCKTVIALHKVNFYCSSLLLACIAVDRYLAIV 
HAVHAYRHRRLLSlHTTCGnWLVGFLLALPEI 
LFAKVSQGHHNNSLPRCTFSQENQAETHAWF 
TSRFLYHVAGFIXPMLVMGWCYVGWHRLR 
OAORRPQRQKAVRVAILVTSIFFLCWSPYHIV 
tTTT T^Ti ADi V AvmsimcLNGSLPVAITMCEFL 
GLAHCCLNPMLYTFAOVKFRSDLSRLLTKLQ 
rrOPASLCOLFPSWRRSSLSESENATSLTTF 


894 


2244 


A 


7738 


670 


287 


" FVTRAGRWGAGARVRGGAGGMASOAAKWi- 
VLAPVRSGALRSGPSLRKDGDVSAAWSGSGR 
SLWSRSVrVTRSGABLPKPVKMSFOLLRVFSI 
VIPFLYVGTLISKNFAALLEEHDIFVPEDDDDD 


895 
896 


2245 
2246 


A 

A 


7753 
' 7754 


119 
1 


278 
372 


" ■ APYAHSQVHCLDKVCGLLPFLNPEVFDgMK 

1 V/l Sf ^ HAfiCTAPHCPRTRPL 
- spAWWh^SC^RVVSPH^LTLEFTFHHLLr^^ 
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SEQ ED 
NO: of ^ 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


N/Iet 
tiod 


SEQ 1 
[DNO: 1 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofirst 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aiaiune C=Cysteme. . 
l>=Aspartic Acid, Ep=Glutamic Acid, 
F=^eaylalanine, G=<jlycinc, H«Histidine, 
I«^Isolcudne, K^Lysmc, L=l.euciDe, 
M=Methionine, N=Asparagine. P^Proline. 
Q=Glutamine, R«Argininc, S=Serine, 
T^Threonine, V=Valine, W=Tryptophan, 
y=Tyrosine, X=Unknown, *=Stop codon, 
>=possible nucleotide deletion, V=possible 
nucleotide insertion 














QVSTAALAVLLCTI^CNQVLSAPLAAU I if I 
ACCFSYTSRQIPQNFIADYFETSSQCSKPSVIFL 
TKRGROVCA0PSEEWVQKYVSDLELSA 


897 


2247 


A 


7761 


1725 


445 


"RPRRRGTHHFSCVLOSFRVSAMFPRVSTFLPL 
mSRHPLSSGSPETSAAAIMLLTVRHGTVRY 

RSSALLARTKNNIQR I rU i No v iVyiKPa^^v«3 * 

RTEETSKETSESQDSEIOENTKKDLLGIUCGMK 

VEI^TVNVRTTKPPKRRPLKSLEATLGRLRRA 

TEYAPKKRIEPLSPELVAAASAVADSLPFDKQ 

TTKSELLSQLQQHEEESRAQRDAKRPKISFSNl 

ISDMKVARSATARVKoi<xJbiJU vrx-'Du i ui^ i r 

GQEKTDDLKKRKNIFTGKRLNIFDMMAV^ 

apetdtspslwdvefakqlatvneqplqngf 
eeliqwtkegklwefpinneagfdddgsefh 
ehjflekhlesfpkqgpirhfmelvtcglsknp 
ylsvkqkvehiewfrnyfnekkdilkesniqf 

klrpwkflfrnn 


898 


2248 


A 


7775 * 


85 


496 


SCQTTQPPAQSCSTGTIVLRIMLLFTAILAhbl-A 
QSFGAVCKEPQEEWPGGGRSKRDPDLYQLL 
QRLFKSHSSLEGLLKALSQASTDPKESTSPEK 
RDMHDFFVGLMGKRSVQPDSPTDVNQENVP 
SFGILKYPPRAE 


899 


2249 


A 


7785 


179 


703 


"PFHLGASSNTFRLQVQTQESKAQKEVKMGFI 
FSKSMNESMKNQKEFMLMNARLQLERQLIM 
QSEMRERQMAMQIAWSREFLKYFGTFFGLA 
AISLTAGAIKKKKPAFLVPIVPLSFILTYQYDL 
GYGTLLERMKGEAEDILETEKSKLQLPRGMrr 
FESEEKARKEQSRFFIDK 






A 


7789 


1465 


300 


VWLPLKSYKIRSPSLHCQCEIFREEFLFSSH^I: 

GRDKDTFSKMAMVSEFLK.QAWribWiiii'^ii i 

VQTVKSSKGGPGSAVSPYPTFNPSSDVAALH 

KAIMVKGVDEATIIDILTKRNNAQRQQIKAAY 

LQETGKPLDETLKKALTGHI^BVVIjUXKTP 

AQFDADELRAAMKOLGTDEDTLIEILASRTN 

KEIRDINRVYREBLKRDLAKDITSDTSGDFRN 

AaSLAKGDRSEDFGVNEDLADSDARALYEA 

GERRKGTOVNVFNTILITRSYPQLRRVFQKY 

TKYSKHDMNKVLDLELKGDIEKCLTAIVKCA 

T'evnArTJAcin MOAMKGVGTRHKALIRIMVS 

rshdmndikafyqkmygislcqaildetkgd 
yekilvalcggn 


901 


2251 


A 


7796 


2 


807 


" 'VEFHPQRARAGARAPSMG VLLTQRTJLLSI-Vl. 

allfpsmasmaaigscskeyrvllgqlqkqt 

DLMQDTSRLLDPYIRIQGLDVPKLREHCRERP 

gafpseetlrglgrrcflqtlnatlgcvlhrl 
adleqrlpkaqdlersglniedleklqmarp 
nilglknniycmaqlldnsdtaeptkagrga 
sopftftpasdafqrklegcrflhgyhrfmh 

SVGRVFSKWGESPNRSRRHSPHQALRno V KK 
TRPSRKGKRLMTRGQLPR 


902 


2232 


A 


7802 


2 


721 


- 'tAARRRQKGTAARRLQKGTAARRRQI^^UI AA 
RRRQKGTAARRPQKGTAARRRQKGTAARRR 

okgtaarrrqkgtaarrpqkgtaarrrqkg 
taarrrqkgtaarrrqkglaiasrgcpcasr 
aggvrgagsrlramapkvfrqywdipdgtd 
chrkaysttsiasvagltaaayrvtlnfpgtf 
legvakvgqytftaaavgavfglttcisahv 
rekpddplnyflggcaggltlgarthnygig 
aaacvyfgiaaslvkmgrlegwevfakpkv 
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PCTAJSOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng 10 Uibi 

amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence (A^Alanine OCysteine, 
]>Aspaitic Acid, &=Gtutamic Acid, 
F'^Phenylalanine, 0«Glycme, H==Histidine, 
I=Isoleucinc, K^Lysinci, IM^cine, 
M=Methioninc N=Asparagine, P=Proline, 
QKilutaminCi R=Aigiiiine, SpSerinc, 
T=Thrconine, V=Valine, W=Tiyptophan, 
Y'Tyrosine, X«=^nknown, ♦=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 


903 


2253 


A 


7807 




584 


PWU*WSIXjRAARSSRKCPRSRFPVQVGKMA 

VSTWSTSSUiLAl^imSIXSPlJLSVTSFRRrY 

RGDSPTDSgroMIEIPLPPWQERTDESIBrKR 

AW-LYESRKRGMLENCiLLSLFAKEHLQHMT 

EKQLNLYDRLINEPSNDWDIYYWAIEAKPAP 

EIFE1^VMA1IJ®FAKNKNKEQR^^ 

LFEKPR . , 


904 


2254 


A 


7813 


40 


821 


"GAGRALGHLlBtGAGDVAAALPAbKlFPRSLLG 
AGARLTGWTMNVFRILODLSHLLAMILLLGK 
IWRSKCCKGlSGKSQnJ'ALVFTOYLDLFrNF 
T<?TVTsrrvMKVVFllX}AYVTVYMIYOKFRKTF 
DSENDTFRLEFLLVPVIGLSFLENYSFTLLEIL 
WTFSIYLESVAILPQLFMISKTGEAKlTriHYL 
FFLGLYRALYLANWIRRyQTENFYDQIAWS 
GWQnFYCDFFYLYVTKGRSWDDSNADTGL 

RSYSSI 


905 


2255 


A 


7817 


1399 


881 


■ LSNKDVLSPQLKDENSKLRRKLNEVQSFSEA 
QtEMVRTl£RKLEAKMIKEESDYHDLESVVQ 
QVEQNLELMTXRAVKAENHVVKLKQEISLL 
QAQVSNFQRENEALRCGQGASLTWKQNAD 
VALQNLRWMNSAQASIEQLVSGAETLNLVA 
EILKSIDRISEVKDEEEDS 


906 


2256 


A 


7822 


3 


1462 


DSPRNRFEILGRPTRTPTRPGFRPAMEDLDAL 

LSDLETTTSHMPRSGAPKERPAEPLTPPPSYG 

HQPQTGSGESSGASGDKDHLYSTVCKPRSPK 

PAAPAAPPFSSSSGVLGTGLCELDRLLQELNA 

TmajrmFTVi^OFPSSKVASGEOKEDOSEDKK 

RPSLPSSPSPQLPKASATSATLELDRLMASLSD 

TOVO>mLPASGPT0PPWSSTNEGSPSPPEPTG 

KGSLDTMLGLLQSDLSRRGVPTQAKGLCGSC 

NKPIAGQWTALGRAWHPEHFVCGGCSTAL 

GGSSFFEKDGAPFCPECYFERFSPRCGFCNQPI 

RHKNfVTALGraWHPEHFCCVSCGEPFGDEG 

FHEREGRPYCRRDFLQLFAPRCXJGCQGPILDN 

YISALSALWHPDCFVCRECFAPFSGGSFFEHE 

GRPLCENHFHARRGSLCATCGLPVTGRCVSA 

LGRRFHPDHFTCTFCLRPLTKGSFQERAGKPY 

COPCFLKLFG 


907 


2257 


A 


7828 


1792 


1671 


■ FIYVNQSFAPSPDQEVGIXYECFGSDGKLVLH 
YCKSOAWG 


908 


2258 


A 


7842 


no 


1 1T7 
11 /X 

• 


■ "klscpcshgtrvtavrgprlkagyqwhulu 

SLQPPPSGLKQSSHLSLSSSWDFRHAPTHPET 

YTCPKMDBMEQAEAQLAELDLLASMFPGENE 

LIVNDQLAVAELKDCIEKKTMEGRSSKVYFn 

NMNLDVSDEKMAMFSIACILPFKYPAVLPEI 

TVRSVlXSRSQ(?rQLITrDLTAFLQKHCHGDV 

CILNAIEWVREHASGYVSRDTSSSFITGSTVQ 

SVDLIFIRLmSHHnrNKCKRKNEXWAKEL 

KLNWRILIRHREDIPFDGTNDErERQRKFSIF 
EEKVFSVNOARGNHMDFGQLYQFLNTKGCG 
DVFOMFLWV 


909 


2259 


A 


7870 


3067 


2923 


■ iBGICVYTFrmiMYTR'rCMHTYPYMYMNS V 
LISSEILLIPSKYLFESK 


910 


2260 


A 


7884 


212 


4874 


■ GALTWSHPLLAVCPQGVWLGSTTSGSPALU' 
PSHRVNAEPGCWTNACASGPCPPHANCRDL 
WQTFSCTCQPGYYGPGCVDACLLNPCQNQG 
SCRHLPGAPHGYTCDCVGGYFGHHCEHRMD 
QQCPRGWWGSPTCGPCNCDVHKGFDPNCNK 
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wo 01/57188 



PCTAJSOl/03800 



SEQIU : 
NO: of 1 
nucl- 1 
eotide 
seq- 
uence 


SEQID 1 
M : of 1 
)eptide 
seq- 
uence 


V[et ! 
lod ] 


3EQ ] 
[DNO: 1 
m ] 
USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino • 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Aianine (^Cysteine, 
D='Aspartic Add, B»Glutamic Aci4 
F=Ph«iylaIanine, G=01ycine» H«Histidine> 
l=Isoleucinc, K»Lyslne, LpLeucinc, 
M^Methionine, N^Asparagine. P=Pioline, 
Q=G!utamine, R«Ariginine, S=Serine, . 
T=Threonine, V=Valine, W-Tiyptophan, 
Y^Tyrosine, X-Unknown, ♦=Stop codon, 
/>=^ssible nucleotide deletion, \»possibte 
nucleotide insertion 














TNGQCHCKEFHYRPRGSDSCXPUDCYPVGST 

SRSCAPHSGQCPCRPGALGRQCNSCDSPFAEV 

TASGCRVLYDACPKSLRSGVWWPQTKFGVL 

ATVPCPRGALGLRGAGAAVRLCDEAQGWLE 

PDLFNCTSPAFREI^LLLDGLELNKTALimiE 

AKKLAQRLREVTGHTDHYFSQDVRVTARLL 

AHLLAFESHQQGFGLTATQDAHFNENLLWA 

GSALLAPETGDLWAALGQRAPGGSPGSAGLV 

RHLEEYAATLARNMELTYLNPMGLVTPNIML 

SIDRMEHPSSPRGARRYPRYHSNDFRGQDAW 

DPHTHVLLPSQSPRPSPSEVLPTSSSIENSTTSS 

VVPPPAPPEPEPGISinLLVYRTLGGLLPAQFQ 

AERRGARLPQNPVMNSPWSVAYFHGRNFLR 

GILESPISLEFRLLQTANRSKAICVQWDPPGLA 

EQHGVWTARDCELVHRNGSHARCRCSRTGT . 

FGVLMDASPRERLBGDLELLAVFIHVVVAVS 

VAALVLTAAILLSLRSLKSNVRGIHANVAAA 

LGVAELLFLLGmTHNQLVCTAVVILLHYFF 

LSTFAWLFVQGLHLYRMQVEPRNVDRGAMR 

FYHALGWGVPAVULGLAVGLDPEGYGNPDF 

CWISVHEPLIWSFAGPWLVIVMNGTMFLLA 

ARTSCSTCQREAKKTSAL1LRSSFLLLLLVSA 

SWLFGLLAVNHSEj\FHYLHAGLCGLQGLAV 

LLLFCVLNADARAAWMPACLGRKAAPEEAR 

PAPGLGPGAYNNTALFEESGLnOTLGASTVSS 

VSSARSGRTQDQDSQRGRSYLRDNVLVRHGS 

AADHTDHSLQAHAGPTDLDVAMFHRDAGA 

DSDSDSDLSLEEERSLSIPSSESEDNGRTRGRF 

QRPLCRAAQSERLLTHPKDVDGNDLLSYWPA 

LGECEAAPCALQTWGSERRLGLDTSKDAAN 

NNQPDPALTSGDETSLGRAQRQRKGILKNRL 

QYPLVPQTRGAPELSWCRAATLGHRAVPAAS 

YGRIYAGGGTGSLSQPASRYSSREQLDLLLRR 

ni CDirQT PPAPAPVLRPLSRPGSOECMDAAPG 

RLEPKDRGSTLPRRQPPRDYPGAMAGRFGSR 

DALDLGAPRBWLSTLPPPRRTRDLDPQPPPLP 

lspqrqlsrdpllpsrpldslsrssnsreqldq 
vpsrhpsrealgplpqllraredsvsgpshgp 
steqldilssilasfnssalssvqssstplgpht 

TATPSATASVLGPSTPRSATSHSrSELSPDSEPR 

dtqallsatqamdlrrrdyhmerpllnqeh 

leelgrwgsaprthqwrtwlqcsrarayal 

llqhlpvlvwlprypvrdwllgdllsglsva 

imqlpqglayallaglppvfglyssfypvfiy 

flfgtsrhisveslcvpgpvdt 


911 


2261 


A 


7890 


21 


806 


" EiFQTSRSSRSMAEDLGLSFGETASVEMLl'littu 

scrpkarsssarwaltcclvllpflaglttyl 

lvsqlraqgeacvqfqaixgqefapshqqv 

yaplradgdkprahltwrqtptqhfknqfp 

ALHwEHELGLAl* 1 KJn RMN i i iNJvrt»-Uriniovji^ 
YFIYSQVTFRGMTSECSEIRQAGRPNKPDSrrV 
VmCVTDSYPEPTQLLMGTKSVCEVGSNWFQ 
praGAMFSLQEGDKLMVNVSDISLVDYTKE 

DKTFFGAFLL 


912 


2262 


A 


7891 


■-i2eJ3 


111 


" ACGIRHEGALPGUATPEAMLmPDLA^Sl^L 
LILALGQAVQFQEYVFLQFLGLDKAPSPQKFQ 
PVPYILKKIFQDREAAAmVSRDLCYVKELG 
VRGNVLRFLPDQGFFLYPKKISQASSCLQKLL 
YFNLSAKEREQLTLAQLGLDLGPNSYYNLGP 
ELELALFLVOEPHVWGQfrTPKPGKMFVLRSV 
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PCTAJSOl/03800 



SEQID J 
NO: of T 
nucl- 1 
eodde 2 
seq- 1 

U6DCC 


JEQID I 
<0 : of I 
)eptide 
seq-. 
iience 


lod I 
i 
1 
( 


;eq I 

DKO: I 
n I 
JSSN 1 
)9/496 i 


>iedicted 1 
)eginnmg 1 
nucleotide 1 
ocation 
;:onespondi 

amino acid 
residue of 
peptide 
sequence 


>redictedeDd i 
lucleotide 
ocation 
coiresponding 
to last amino 
acid residue 
of peptide 
sequence 


inino acid sequence (A=Alanlne u=<jysieine, 
>Aspartic Acid, Er=Glutamic Acid, 
7=Phenylalanine, G=Glycine, H=Histidine, 
'^Isolcucine, K-Lysine, L«Leucine, 
M=Mcthionine, N«Asparaginc, P==Proline, 
Q=Glutainine> R^Arginine, S?=Serinc, 
T^Threonine. V=VftIinc, W^Tiyptophan. 
Y=Tyiosine, X«=Unknown, *=Stop codon, 
/=poss!ble nucleotide deletion, \=possibIe 
nucleotide insertion 














PV^AVHFNLLDVAKJJWNDNPRKNFGLl^i- 

EILVKEDRDSGVNFQPEDTCARLRCSLHASLL 

vmNPDQCHPSRKIUU^VPKLSCKNLCH 

MOUWRDWWHKWnAPKGFMANYCHGE 

CPFSLTISLNSSNYAFMQALMHAVDPEIPQAV 

aPTKLSPISMLYQDNNDNVIUarra)MV^ 

KCGCG 


913 


2263 


A 


7892 


15 


849 


ASRLPRGPOCUADMRPLUiUXVFAGCrhAL 
YLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAE 
LRELSEVLREYRKEHQAYVFLLFCGAYLYKQ 
GFAIPGSSFLKVLAGALFGPWLGLLLCCVLTS 
VGATCCYLLSSIFGKQLWSYFPDKVALLQR 
KVEENRNSLFFFlXFU«ia>MTPNWFLNLS^I 
LNIPIVQFFFSVLIGLIPYNHCVQTGSILSTLTS 
LDALFSWDTWKLLAIANTVALIPGTIJKJKFSQ 
.pi.NETSTANHIHSRKDT 


914 


2264 


A 


7893 


815 


959 


KSGWVWWLTPUPALV/EAyrEOSLRPEVRM 
RLSNTTRPFFSKKKKILV 


915 


2265 


A 


7909 


3 


641 


HASGPGGLLRRRRGSGANMPVARSWVUKRT 
YVTPRRPFEKSRLDQELKUGEYGLRNKREV 
WRVKFTIAKIRKAARELLTLDEKDPRRLFEG 
NALLRRLVRIGVLDEGKMICLDYILGLKIEDFL 

errlqtqvfklglaksihhahvliqqchirvr 
eqvvnilfftvrldsqkhidfslcfpigvanps 
hvkrknaskgoggagarddeeee 


916 


2266 


A 


7914 


3 


967 


vahtqwhtcqrlsqlthrsilkyllidthac 

QVLIUanHASI^LPSCQECFPSSIPSASHMVS 

hphpppsprwgqtpeglpaaspcgpoprscfs 
silftgdswgmlaclctvlwhlpavpalnrt 

GDPGPOPSIQKTYDLTRYLEHQLRSLAGTYLN 

ylgppfnepdfnpprlgaetlpratvdlevw 

rslndklrltqnyeayshllcylrglnrqaa 

taelrrslahfctslqgllgsiagvmaalgy 

PLPQPLPGTEPTWTPGPAHSDFLQKMDDFWL 
UCELQTWLWRSAKDFNlOjCKKMQPPAAAW 

LHLGAHGF 


917 


2267 


A 


7921 


2 


1166 


' "RPRRGQGLVQEVQTENVl'VAEGGVAmiUKL 
HOYDGSIWIQNPARQTLFFNGTRALKDERFQ 
LEBFSPRRVRIRLSDARLEDEGGYFCQLYTED 
THHQIATLTVLVAPENPWEVREQAVEGGEV 
ELSCLVPRSRPAATLRWYRDRKELKGVSSSQ 
ENGKVWSVASTVRFRVDRKDDGGfflCEAQN 
OALPSGHSKQTQYVLDVQYSPTARIHASQAV 
VREGDTLVLTCAVTGNPRPNQIRWNRGNESL 
PERAEAVGETLTLPGLVSADNGTYTCEASNK 
HGHARALYVLWYGESRLRPTEGGGGAPDP 
GAWEAQTSVPYAIVGGILALLVFUICVLVG 
MVWCSVRQKGSYLTHEASGLDEQGEAREAF 

1 NGSDGHKRKEEin 


918 


2268 


A 


7938 


3 


2653 


- ■krrlppasppsssvssslspsawmackwsijs. 

ESPRWRSALLLLFLAGVYGNGALAEHSENVH 

isgvstacgetpeqirapsghtspgwpseypak 

INCSWFIRANPGEimSFQDFDIQGSRRCNLD 
Virt*TIETYKNnSSYRAa3STIPPPyiSSQDHIWIR 

fhsdbnisrkgfrlayfsgkseepncacdqfr 
cgngkcipeawkcnnmdecgdrsdeeicake 
anpftaaafqpcaynqfqclsrftkvytcip 

ESLKCIX5NIDCLD1X}DE1DCDVPTCGQWL1CY 
FYGlTOSFNYroFYPPGSNCTWUDTOPHRK 
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PCTAJSOl/03800 



Amino acid sequence (A=Alanine 0=Cysteine, 
D=»Aspartic Add, E^lutamic Acid, 
F=PhcnylalBninc, G=Glycine, H=«Histidine, 
I=Isolaicinc, K=Lysinc, L=Lcucinc, 
M=^ethionin^ N=Asparagine, P=Proline, 
(^jJutamine. R=Arginine, S=Serine, 
T=Threoninc, V=Valine, W=Tryptophan, 
Y^^ryrosine, X«Unknown, *«Stop codon, 
A=possible nucleotide deletion, possible 
nucleotide insertion 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 



SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



SEQ 
ID NO: 
in 

USSN 
09/496 
914 



Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
f peptide 
sequence 



919 



2269 



920 



2270 



2271 A 



7951 



7953 



922. 



2272 



923 



2273 



7957 



7967 



7981 



m^TOTKLDGTGYGDYVKIYDGLEENPHK 
LLRVLTAFDSHAPLTWSSSGQIRVHFCADKV 
NAARGFNATYQVDGFCLPWEIPCGGNWGCY 
TEQQRCDGYWHCPNGRDKmcrMCXJKEEFP 
CSRNGVCYPRSDRCNYQNHCPNOSDEKNCFF 
CQPGNFHCKNNRCVFESWVCDSQDDCGDGS 
DEENCPVIVPTTlVrrAAVIGSUCXjLLLVIALG 
CTCKLYSLRMFERRSFETQLSRVEAELLRREA 
PPSYGQUAQGLIPPVEDFPVCSPNQASVLENL 
RLAVRSQLOTTSVRLPMAGRSSNIWNRIFNFA 
RSRHSGSLALVSADGDEVVPSQSTSREPEKNH 
THRSLFSVESDDTDTENERRDMAGASGGVAA 
PLPQKVPPTTAVEATVGACASSSTQSTRGGH 
ADNGRDVTSVEPPSVSPARHQLTSALSRMTQ 
GLRWVRFTLGRSSSLSQNQSPLRQLDNGVSG 
REDDDDVEMLIPISDGSSDFDVNDCSRPLLDL 
ASDQGQGLRQPYNATNPGVRPSNRDGPCERC 
GIVHTAQIPDTCLBVTLKNETSDDEALLLC 



1674 



1839 



WRVTCCPPARS'l'lERTNAYDEEDCVliMVAS 
fy,WNroVACHTTMYFMCEFDKKNM 



47 



572 



GGRASWPEQAKEPRREOHTDKQQTEDVLAA 



GLRCLPHLPAICARRMSPAFRAMDVEPRAKG 
VLLEPFVHQVGGHSCVLRFNETTLCKPLVPRE 

hqfyetlpaemrkftpqykgksqlleglphw 
rgdvrdrghgrpwqpslepslpptlcfpslss 
fssswpsaqhltpsvfnpw 



612 



812 



rsgrtvvtgigyskalq^snrntksllqnef 
mmvysfralsfkestwatfqhggeatksrsl 

SSTQ 



1443 



1660 



enitekwkeiwmcrgnkksccwtfikdrhli 
vsccksksgetllicifcsnlvgffffgirgfsn 

WELVKPN 



3023 



gsapraatamararpppppspppgllpllppll 
ixpluxpagcraleetlmdtkwvtselawt 
shpesgweevsgydeamnpirtyqvcnvres 
sqnnwlrtofiwrrdvqrvyvelkftvrdc 
nsipnipgscketfnlfyyeadsdvasasspfw 
menpyvkvdtiapdesfsrldagrvntkvrs 
fgplskagfylafqdqgacmslisvrafykk 
casttagfalfpetltgaeptslviapgtcipn 
avevsvplklycngdgewmvpvgactcatg 
hepaakesqcrpcppgsykakqgegpclpcpp 
nsrtrspaasictchnnfyradsdsadsactt 
vpspprgvisnvnetsle.ewseprdlgvrdd 

aYNVICKKCHGAGGASACSRCX>DNVEFVPR 

qlglseprvhtshllahtrytfevqavngvs 

gksplppryaavnntnqaapsevptlrlhss 

sgssltlswapperpngvudyemkyfekseg 

lastvtsqmnsvqldglrpdarywqvrart 

vaoygqysrpaefettsergsoaqqlqeqlp 

livgsataglvfwawviaivclrkqrhgs 

dseyteklqqyiapgmkvyidpftyedpnea 

vrefakeidvscvkieevigagefgevcrgrl 

kqpgrrevfvaktlkvgyterqrrdflsea 

simgqfdhpniirlegvvtksrpvmiltefme 

ncaldsflrlndgqftviqlvgmlrgiaagm 

kylsemnyvhrdlaarnilvnsnlvckvsdf 

glsrfleddpsdptytsslggkipirwtapeai 

avrkftsasdvwsygivmwevmsygerpy 
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PCT/USOl/03800 



SEQID J 
NO: of 1 
nucl- 1 
eotide 
seq- 
uence 


SEQID 1 
^0: of 1 
peptide 
seq- 
uence 


Viet 1 S 
lod 1 
i 
1 

1 
1 


5EQ 1 
DNO: 1 
n 1 
JSSN 

?14 


Predicted ] 
beginning 
lucleotide 
ocation 

mrrfOTiondi 

ng to first 
amino acid 
residue of 
peptide 
sequence 


'redicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


/Vminoacid sequence (A==Alamne OCystcme, 
CNAspartic Acid, e=Glutamic Acid, 
F^Phenylalanine, G=Glycine, H=Histidinc, 
hlsoleucme, K=Lysine, Lp=Lcucine, 
M=Metbionine. N=Asparagine, P=Proline. 
Q=Glutamine» R^Arginine, S=Serine, 
T«Threoninc, V«Valine, W^Tryptophan, 
Ys^ryrosine, X=<Jnknown, *=Stop codon, 
Aspossible nucleotide deletion, \F=possible 
nucleotide insertion 














WDMSNQDVINAVEQDYRLPPPMDCPTALHQ 
LMLDCW\ra)IOCRPKFSQIVNTLDKLIRNAA 
SUCYIASAQSGMSQPULDRTVPDYru 1 1 VGD 
WLDAIKMGRYKESFVSAGFASFDLVAQMTA 
EDLLRIGVTIAGHQKKILSSIQDMRLQMNQT 

LPVQV 


924 


2274 


A 


7985 


1 


•503 


FRPRTKKATAMYLEHYLDSIHNLPCELQRNi' 

QLMRELDQRTEDKKAEIDILAAEyiSTVKTLS 

PDQRVERLQKIQNAYSKCKEYSDDKVQLAM 

QTYEMVDKHIRRLDADLARFEADLKDKMEO 

SDFESSGGRGLKKGROQKEKRGSRGRGRRTS 

EEDTPKKKKHKGG 


"925 


2275 ■ 


A 


7994 


447 


589 


LPCSFCAQCMSSFERVWLQQSHFHNPRWNSK 
SPIRCYCOHWPHCVHC 


926 


2276 


A 


7996 


925 


582 


GPOCVCCITLAIMLQCHSFYRKDVQVEHPKS 
LNPKYSQIENFLSADMALKRKCLLSISDLDFW 
IWDAQPVGIMQTLQNLKKIPNPGCFWSQAFQI 
ppTHPn PT^GGRYYTTIRO 


927 ! 


2277 


A 


7998 


2 


353 


RIQM>U^SRSP^mSlJFVKAHL1'AKQAT^^ 
CLIXVTLALCCYQANAEFCPALVSELLDFFFI 
SEPLFKLSLAKFDAPPEAVAAKLGVKRCTDQ 
MST.OKRSLIAEVLVKILKKCSV 


928 


2278 


A 


8004 


130 


588 


LAPLRCQPGTRTQPRSHPAANDPSAAM2SAAU 
ARGUO\TYHRLLDKVELMLPEKLRPLYNHPA 
GPRTVFFWAPIMKWGLVCAGLADMARPAEK 
LSTAQSAVLMATGHWSRYSLVIIPKNWSLFA 
VNFFVGAAGASQLFRIWRYNOELKAKAHK 


929 


2279 


A 


1 8007 


2 


1016 


EFARRRVFIAAREMSLLRSLRVhL V AK 1 GSYP 

AGSIXRQSPQPRHTFYAGPRLSASASSKELLM 

KLRRKTGYSFVNCKKAUBTCGGDLKQAEIWL 

HKEAQKEGWSBCAAKLQGRKTKEGLIGLLQE 

ONTTVLVEVNCETDFVSRNLKFQLLVQQVAL 

GTMMHCQTLKDQPSAYSKGFLNSSELSGLPA 

GPDREGSUOJQLAIAIGKLGENMILKRAAWV 

KVPSGFYVGSYVHGAMQSPSLHKLVLGKYG 

ALVICETSBQKTNLEDVGRRLGQHWGMAPL 

SVGSLDDEPGGEAKTKMLSQPYLLDPSITLGQ 

YVOP0GVSWDFVRFECX3EGEEAAETE 


930 


2280 


A 


8008 


3 


1679 


■ NSRVWGPWTEPSAGSLRPMARKQNRNSKEL 
GLVPLTDDTSHAGPPGPGRALLECDHLRSGV 
PGGRRRKDWSCSLLVASLAGAFGSSFLYGYN 
LSWNAPTPYKAFYNESWERRHGRPIDPDTL 
TLLWSVTVSIFAIGGLVGTLIVKMIGKVLGRK 
HTLLANNGFAISAALLMACSLQAGAFEMLIV 
GRFIMGIDGOVALSVLPMYLSEISPKEIRGSLG 

qvtaificigvftgqllglpellgkestwpylf 
gviwpawqllslpflpdsprylllekhnea 
ravkafqtflgkadvsqeveevlaesrvqrs 

TRLVSVI^LRAPYVRWQVVTVIVTMACYQL 

CGU4A1WFYTNSIFGKAGIPPAKIPYVTLSTGG 

IBTLAAVFSGLVIEHLGRRPLUGGFGLMGLFF 

GTLTTTLTLQDHAPWVPYLSIVGILAIIASFCSG 

PGGIPFILTGEFFQQSQRPAAFnAGTVNWLSN 

FAVGLLFPRQKSLDTYCFLVFATICrrGAIYL , 

YFVU13rrIa^TYAHSQAFSKK^^ 

DSAVTDGKINGRP 


931 


2281 


A 


8009 


861 


300 


" AAGAWSAMPKAKGKLTRKQKFGYSVNKlua. 
NRNARRKAAPRIECSHIRHAWDHAKSVRQNL 
ABMGLAVDPNRAVPLRKRKVKAMEVDIEER 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

USSN 
09/496 
914 


Predicted 
beginning 

location 
correspondi 
ng to first 
amino acid 

rcbiuuc \jx 

peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteme, 
D=Aspartic Acid, B=Glutamic Acid, 
F=Phenylalanine, GKJlycine, H«Histidine, 
I«lsolcucinc, K=Lysine, L=Lcucine, 
M=Methionine.N=Asparagine, P=Proline, 
Q=Glutamine, R«Arginine, S^Serine, 
T^Threoninc, V^Valine, W=Tryptophan, 
Y=^yrosine, X=Unknown, ♦^Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














■pKELVRKPYVLNDLEAEASLPEKKGNTLSRD 
LIDYVRYMVENHGEDYKAMARDEKNYYQD 
TT^KQIRSKINVYKRFYPAEWQDFLDSLQKRK 
MEVE 


932 


2282 


A 


son 


412 


I 


"SNLCLGNSWRWRWAKSRHHCIPTVTLSKRSG 

DIRGSHFSSPQRQRSQRVPGKETARVLRAGK 

QGRGQlPIPCPWPPPPPPPPPGSPGPGCa^QFHQ 

SLEAKARHPASVR£MRGKV(a«i4RRALRRAPA 

STRASSRQPNPK 


933 


2283 


A 


8012 


147 


1077 


"PPVPPASRSDMAQNLKDLAGRLPAGPRGMar 
ALKLLLGAGAVAYGVRESVFTVEX3GHRAIFF 
NRIGGVQQDTlLAEGLHFRIPWFQYPnYDIRA 
RPRKISSPTGSKDLQMVNISLRVLSRPNAQEL 
PSMYQRLGLDYEERVLPSIVNEVLKSWAKF 
NASQLrrQRAQVSLURRELTERAKDFSULDD 
VA1TP1 ^P^RFYTAAVEAKOVAOOEAORAOF 
LVEKAKQEQRQKIVQAEGEAEAAKMLGEAL 
SKNPGYKLRKIRAAQNISKTIATSQNRIYLTA 
DNLVLNLODESFTRGSDSUKGKK 


934. 


2284 


A 


8023 


255. 


982 


SQFSLSQVLVDSAEEGSLAAAAELAAQKRbg 

RLRKiTOLHLMRNEARKLNHQEVVEEDKRL 

KLPANWEAKKARLEWELKEEEKKKECAARG 

EDYEKVKJULEISAEDAERWEIUCKKRKNPDLG 

FSDYAAAQLRQYHRLTKQIKPDMETYERLRE 

KHGEEFFPTSNSLUIGTHVPSTEEIDRMVIDLE 

KQIEKRDKYSRRRPYNDDADIDYINERNAKF 

NKKAERFYGKYTAEKQNLEROTAV 


935 


2285 


A 


8027 


59 


310 


LVSSTVNLLTEKAPWNSLAWTVTSYVFLKl'L 
QGGGTGSTGMRDSALTLLGIGPSHRHSLSIRL 
SOHSSPAPMYSQTFHILVLG 




2286 


A 


8032 


1 


639 
311 


SGRECNMAKTYDYLFKLLLIGDSGVGKTCVL 

FRFSEDAFNSTFISnOIDFKIR'nELDGKRIKLQ 

IWDTAGQERFRXnTAYYRGAMGIMLVYprr 

NEKSFDNIKNWIRNIEEHASADVEKMILGNKC 

DVNDKRQVSKmGEKl^DYGIKFMETSAK 

ANINVENAFFILARDKAKMDKKLEGNSPQG 

SNQGVKITPDQQKRSSFFRCVLL 

EETIHSENSYILEKYIPISANLTLTIA 


937 
938 


2287 
2288 


A 
A 


8039 
8052 


393 
675 


•1334 


" ■ LHPAATSTAWLHVPPGLSMALSWVLTVLbLL 
PLLEAQIPLCANLVPVPrrNATLDRITGKWFYI 
ASAFRNEEYl^VQEIQA'nFFYFTPNKTEDTIF 
LREYCymQDQCIYNTTYLNVQRENGTISRYV 
GGQEHFAHLLILRDIKTYMLAJFDVNDEKNW 
GLSVYADKPETTKEQLGEFYEALDCLRIPBCSD 
WYTDWKKDKCEPLEKQHEKERKQEEGES 


939 


2289 


A 


8055 


12 




' ssvaefpervqlsqpq!nwnfsgaggaw«lw 
aeqlkwsaelarlgesimdgkqggmdgskp 
aoprdfpoirllsnplmodavsdwspmheaa 

fflGHQLSLKNLISQGWAvI*UJTAUnViri-HE.ft 

CLGGHLSCn^KILLKHGAQVNGVTADWHrPL 

FNACVSGSWDCVNLLLQHGASVQPESDLASP 

IHEAARRGHVECVNSLIAYGGNIDHKISHLGT 

PLYLACENQQRACVKKLLESOADVNQGKGQ 

DSPLHAVARTASEELAaLMDFOADTQAKN 

AEGKRPVELVPPESPLAQLFLEREGPPSLMQL 

CRUURKCFGlQQHHKmCLVlPBDLKQFLLH 


1 940 


2290 


A 


8058 


2 


1203 


■ KVLSIREPAHSTARKASBPSQPSQP5QPGUm.i 
ARLRTMDLHLFDYSEPGNFSDISWPCNSSOa 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteme, 
I>Aspartic Add, EF<jlutamic Acid, 
F»Phaiylalaninc> G=01ycine, H^Hisddine, 
Hsolcucine. K=4-ysinc, L=Lcucinc, 
M=Methionme, N«Asparaginc INProline. 
Q=Glutaminc, R=Arginine, S^Scrinc, 
T=Threonine, V=Valinc, W=<riyptophan, 
Y=Tyroane, X^Uriknown, ♦"^top codon, 
/"pos^ble nucleotide deletion, \ppossible 
nucleotide insertion 














VVDTVMCPNMPNKSVU-YTL^Fr^IFIFVlGMI 

ANSVYVWVNIQAKTTGYOTHCVTLNLAI/^ 

WVVLTIPVWVVSLVQHNQWPMGELTCKVTH 

LireiNLFGSIFRTCMSVDRYLSrrYFrNTPSS 

RKK^mRWCILVWLLAFCVSLPDTYYLKT 

VTSASNNETYCRSFYPEHSnCEWUGMELVSV 

VLGFAVPFSHAVFYFLLARAISASSDQEKHSS 

RKIIFSYVVVFLVCWLPYHVAVLLDIFSILHYI 

PFTCRlJEHALFTAlifVTQCLSLVHCCVNPVL 

YSFINW^YEUiKAFIFKYSAKTGLTKLIDA 

SRVSETEYSALEQSTK 


941 


2291 


A 


8059 


73 


432 


"DNUGLMTIVTSLLFLGVCAHHIIPTGSVVIJPS 
PCCMFFVSKRIPENRWSYQLSSRSTCLKAGV 
IFTTKKGQQFCGDPKQEWVQRYMKNLDAKQ 
If If A CPP A R A V A VKGPVORYPGNOTTC 


942 


2292 


A 


8067 


278 


1262 


GOIGEIKQRPSCLORCLDPSLSVLMNISLXjLGS 

WSAVISQKPSRDICQRGTSLTIQCQVDSQVT 

MMFWYRQQPGQSLTLIATANQGSEATYESGF 

VIDKFPISRPNLTFSTLTVSNMSPEDSSIYLCSA 

GRQGTYEQYFGPGTRLTVTEDLKNVFPPEVA 

VFEPSEAEISHTQKATLVCLATOFYPDHVELS 

WWVNGKEVHSOVSTDPQPLKEQPALNDSRY 

CLSSRLRVSATFWQNPRNHFRCQVQFYGLSE 

NDEWTQDRAKPVTQIVSAEAWGRADCGFTS 

CQvnnnvf 9ATTLYE1LLGKATLYAVLVSALV 

LMAMVKRKDSRG 


943 


2293 


A 


8070 


1 


879 


■ MVKVVPATROT^LPRSQLTGTHQHCQPREFKl 
TASERLRRRPRATARLRAHAAPPEPPLAVFAP 
PSDRKELLALPVACDPVIASVMSWVQAASU 
QGPGDKGDVFDEEADESIXAQREWQSNMQR 
RVKEGYRDGIDAGKAVTLQQGFNQGYKKGA 
EVDLNYGRLRGTLSALLSWCHLHNNNSTLINK 
INNLLDAVGQCEEYVLKHLKSITPPSHWDLL 
nsTFDMDLCHVVPAEKKIDEAKDERLCENNA 
EFNKNCSKSHSGIDCSYVECCRTQEHAHSGK 
PKPHMDFGTDSQF 


944 


2294 


A 


8073 


1 


797 


ESAkWSRQLkRTLIRLSFPlSCGRSHAFGGCK 

MAATSGTDEPVSGELVSVAHALSLPAESYGN 

DPDIEMAWAMRAMQHAEVYYKLISSVDPQF 

LKLTKVDDQIYSEFRKNFETLRIDVLDPEELK 

SESAKEKWRPFOJO^GIVEDFNYGTLLRLD 

CSQGYTEENTIFAPRIQFFAIEIARNREGYNKA 

VYISVQDKEGEKGVNNGGEKRADSGEEENT 

KNGGEKGADSGEEKEEGINREDKTDKGGEK 

GKEADKEINKSGEKAM 


945 


2295 


A 


8074 


z 


SOS 


" "GAATLLRSASSAARKAAEAEQVWLHLHRYL 
SADRRVLGLREWGRPASERECSLCQRLKREL 
NMGDVEKGKKIFIMKCSQCinVEKGGKHKT 
GPNLHGLFGRK 1 uyAru i d i i aathaj^ivvjix vf 
QEim.MEYLENPKKYIPOTKMIFVGIKKKEER 
ADLIAYLKKATNE 


946 


2296 


A 


8081 


42 


590 


' ■ EGRRGKFGGKLCNFLFYFHSNSAESRMDVLf 
VAIFAVPLILGQEYEDEERLGEDEYYQWYY 
YTVTPSYDDFSADFTIDYSIFESEDRLNRLDK 
DITEAIETTISLETARADHPKPVTVKFVTTEPQ 
SPRSEAMPCPVLRSPIPLPPVRVPLFRWGCISC 
KKVGRRLLMTLWMGVWQEEIGR 


947 


2297 


A 


8084 


322 


549 


■ ■ GGGSSPRELAGAAGLrV'l'SQAVAARRQQFii*' 
SRARAPAHSLRAALSLASSARSWGAVSRDRG 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locatioQ 
coiresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, 
D=Aspartic Acid, &=Glutamic Acid, 
F=Phenylalaninc, G=01ycine, H=Histidine, 
I=Isoleucine, K«Lysine, LpLeucinc, 
M=Mcthionine,N=Asparagmc, P=Proline, 
Q=Glutamine,R-Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, »=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 
PC3*PAIMYQSSNKC 


948 


2298 

•4; 


B 


8093 


3905 


846 


MEPGEVKDRILENISLSVKKLQSYFAACEUiil 

PAIRNHDKVLQRLCEHLDHAIXYGLQDLSSG 

YWVLVVHFTOREAIKQIEVLQHVATNLGRSR 

AWLYLAjLNENSLESYLRLFQENLGLLHKYYV 

KNALVCSHDHLTLFLTLVSGLEFIRFELDLDA 

PYLDLAPYMPDYYKPQYLLDFEDRLPSSVHG 

SDSLSLNSFNSVTSTNLEWDDSAIAPSSEDYD 

FGDVFPAVPSVPSTDWEDGDLTDTVSGPRST 

ASDLTSSKASTaSPTQRQNPFNEEPAETVSSS 

imPVHTTSQEKEEAQALDPPDACTELEVIRV 

TKJKKHGKKKKSRSDEEASPLHPACSQKKCA 

KQGDGDSRNGSPSLGRDSPDTMLASPQEEGE 

GPSSTTESSERSEPGLLIPEMKDTSMERLGQPL 

SKVIDQLNGQLDPSTWCSRAEPPDQSFRTGSP 

GDAPERPPLCDFSEGIWMDFYRFTVESPST 

VTSGGOHHDPAGLGQPLHVPSSPEAAGQEEE 

GGGGEGQTPRPLEDTTREAQELEAQLSLVRE 

GPVSEPEPGTQEVLCQLKRDQPSPCLSSAEDS 

GVDEGQGSPSEMVHSSEFRVDNNHLLLLMIH 

VFRENEEQLFKMIRMSTGHMEGNLQLLYVLL 

TDCYVYLLRKGATEKPYLVEEAVSYNELDY 

VSVGLDQQTVKLVCTNRRKQFLLDTADVAL 

AEFFLASLKSAMIKGCREPPYPSILTDATMEK 

LAIAKFVAQESKCEASAVTVRFYGLVHWED 

PTDESLGPTPCHCSPPEGTITKEGMLHYKAGT 

SYLGKEHWKTCFWLSNGILYQYPDRTDVIP 

LLSVNMGGEQCGGCRRANTTDRPHAFQVILS 

DPPCLELSAESEAEMAEWMQHLCQAVSKGVI 

PQGVAPSPCEPCCLVLTDDRLFTCHEDCQTSF 

FRSLGTAKLGDISAVSTEPGKEYCVLEFSQDS 

QQLLPPWVIYLSCTSELDRULSALNSGWKnY 

QVDLPHTAIQEASNKKKFEDALSUHSAWQR 

SDSLCRGRASRDPWC* 


949 


2299 


A 


8095 


9 


2374 


ARRADTVLLESPSMLQOLLPVSLLLSVAVSAl 

KELPGVKKYEWYPIRLHPLHKREAKEPEQQ 

EQFETELKYKMTINGKUVLYLKKNKNLLAP 

GYTETYYNSTGKEITTSPQIMDDCYYQGHILN 

EKVSDASISTCRGLRGYFSQGDQRYFIEPLSPI 

HRDGQEHALFKYNPDEKNYDSTCGMDGVL 

WAHDLQQNIALPATKLVKLKDRKVQEHEKY 

lEYYLYLDNGEFKRYNENQDEIRKRVFEMAN 

YXO^YKKLNIWALVGMEIWTDKDKIKIT 

PNASFTLENFSKWRGSVLSRRKRHDUQUTA 

TELAGTTVGLAFMSTMCSPYSVGWQDHSD 

nllrvagtmahemghnfgmfhddysckcps 
ticvmdkalsfyiptdfsscsrlsydkffedkl 
snclfnaplptdnstplcgnqlvemgedcdc 
gtseectniccdaktckjokatfqcalgeccek: 

CQFKKAGMVCRt AJ^bt^i-'JL- "iiMV^iN urvQ%^^ 

pddrfqvngfpchhgkghclmgtcptlqeq 

ctelwgpgtevadkscynrnegoskygycr 

rvddtlipckandtmcgklfcqggsdnlpw 

kgrivtfltcktfdpedtsqeigmvangtbccg 

dnkvcinaecvdjekaykstncsskckghav 

cdhelqcqceegwippdcddssvvfhfsiwg 

vlfpmavifvwamvirhqssrekqkkdqrp 

lsttgtrphkqkrkpqmvkavqpqemsqmk 

phvydijvegneppasfhkdtnalpptvfkd 

npmstpkdsnwca 
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SEQID 5 
NO: of 1 
nucl- 1 
eotide i 
seq- t 
uenoe 


5BQID I 
NJO: of I 
septide 
scq- 
iieiice 


^et J 
lod 1 
1 

1 


5EQ 1 1 
DNO: \ 
in 1 
USSN 
[)9/496 
914 


Predicted H 
seginning i 
lucleotide 
location 
conespondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


'redicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aianme OCysteme, 
>Aspartic Acid, E^Glutamic Acid, 
F=Phenylalaninc, G^Giycine, H=Histidine, 
Nlsoleucinc, K«Lysine, L^Leucine, 
M=MeAionine, N=Aspaiagine, P=Proline, 
QKjlutamine. R=Arginine, S=Serine, 
T==Threonine, V=Valine, W=Ttyptophan, 
Yj^Tyrostne, X=Unknown, ♦=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 


950 


2300 


A 


8100 


1 


1251 


MGLLLMELASAVLGSFLTLLAQFFLLYRRQPii 

PPADEAARAGEGFRYDCPVPGLLLREYLYGG 

ORDEEPSGAAPEGGATPTAAPETPAPPTRETC 

mNATIli'i™EUa)TALTIUlWVTKiaKVEF 

EELLQTKTAGRLLEGLSLRDVFLGETVPFIKTI 

KLVRPWPSATGEPDGPEOEALPAACTEELAF 

EAEVEYNGGFHLAIDVDLVFGKSAYLFVKLS 

RWGRLRLVFTRVPFTHWFFSFVEDPLIDFEV 

RSQFEGRPMPQLTSnVNQUCKnKRKHTLPNY 

mFKPFFPYQTLQGFEEDEBHIHIQQWALTE 

GRLKVXaECSRLLIFGSYDREANVHCTLELS 

SSVWEBKQRSSKTG'nSLTAVFMGWHRVSE 

AFPGLWYKXLVDLPFWGLEDGGPLLTVPLRQ 

CPG 


951 


2301 


A 


810S 


1612 


839 


EVALFCFEMAAGMYLEHYLDSIENLPFELQR 

NFQLMRDU)QRTEDLKAEIDKLATEYMSSAR 

SLSSEEKLALLKQIQEAYGKCKEFGDDKVQL 

AMQTYEMVDKHIRRLDTDLARFEADLKEKQI 

ESSDYDSSSSKGKKKGRTQKEKKAARARSKG 

KNSDEEAPKTAQKKLKLVRTSPEYGMPSVTF 

GSVHPSDVLDMPVDPNEPTYCLCHQVSYGE 

MIGCDNPDCSEEWFHFACVGLTTKPRGKWFC 

PRCSOERKKK 


952 


2302 


A 


8112 


595 




PSVASLARRFSGRALWPPSHSVPGNRALCFKl- 
LHGTTLPGGNQRBLARQKNMKKQSDSVKGK 
RRDDGLSAAARKQRDSTPRPSEIMQQKQKK 

ANEKKEEPK 


953 


2303 


A 


8118 




669 


VCAGIRDPCSTPLAKPAAGGAENLSFGKQPG 

LETNILKMTTPNKTPPGADPKQLERTGTVREI 

GSQAVWSLSSCKPGFGVDQLRDDNLETYWQ 

SDGSQPHLVNIQFRRKTTVKTLCIYADYKSDE 

SYTPSKISVRVGNNFHNLQEIRQLELVEPSGW 

IHVPLTDNHKJOTUTFMIQIAVLANHQNGRD 

THMRQKmPVEESSIGKFPRCrrroFMMYRS 

TR 


954 


2304 


A 


8133 


66 


1015 


PPLPPRSFPNLFSRPEPLPHPGRRGCNKSRKI'A 

ARAPSPPPPFEGAPGRAMVKVTFNSALAQKE 

AKiCDEPKSGEEALnPPDAVAVDCKDPDDW 

PVGQRRAWCWCMCFGLAFMLAGVILGGAY 

LYKYFALQPDDVYYCGIKYIKDDVILNEPSAD 

APAALYQTIEENIKIFEEEEVEnSVPVPEFADS 

DPANIVHDFNKKLTAYLDLNLDKCYVIPLNT 

SIVMPPRNLLELLINIKAGTYLPQSYLIHEHMV 

ITDRIENIDHLGFFIYRLCHDKETYKLQRRKn 

KGIOKREASNCFAIRHFENKFAVETLICS 


955 


2305 


A 


8143 


1 jj 

\ ✓ 


1171 


■ ^V^RSAWHEGEDQlDKLDFIRNQMNLLri.uv 
KKKIKEVTEEVANKVSCAMTDEICRLSVLVD 
EFCSEFHPNPDVIJKIYKSEU^KHIEDGMGRNL 
ADRCTDEVNALVLQTQQEIIENLKPLLPAGIQ 
DKLHTLIPCKKFDLSYNLNYHKLCSDFQEDIV 
FRFSLGWSSLVHRFLGPRNAQRVLLGLSEPIF 
QLPRSIASTPTAPnPATPDNASQEELMITLVT 
GLASVTSRTSMGfflVGGVIWKTIGWKLLSVa 
LTMYGALYLYERLSWTTHAKERAFKQQFVN 
YATEKLRMIVSSTSANCSHQVKQQIATTFARL 
CQQVDITQKQLEEEIARLPKEIDQLEKIQNNS 
KLLRNKAVOLENELENFTKOFLPSSNEES 


956 


2306 


A 


,8157 


1854 


""198 


- ASGSPAPSSSSAMAAACGPUAAUYCLLLGLH 
LFLLTAGPALGWNDPDRMLLRDVKALTLHY 
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957 



2307 



Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 



8159 



1492 



958 



2308 



8161 



959 



I 2309 



960 



rsTo 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



528 



2340 



1192 



521 



2921 



Amino acid sequence (A=Aianine OCystemeT 
D=«Aspartic Add. EFGhitamic Acid, 
F«Phenylalanine, G«Glycinc, H«ffistidine, 
Msoleucinc, M-ysinc, LpLcucine, 
M=Mcthioninc, N=Asparagine» P»Piolin^ 
Q^lutamine, R«Arginine, S=Scrine, 
T^Threonine, V-Valine, W^Tiyptophan, 
Y^Tyrosine, X=Unknowii, ♦=St»p codon, 



nucleotide insertion 
DRYTTSRRLDPIPQLK' 




NTGHGATSGFGSAFTOQ^^Uifiiwaurww iw*- 
GTGGILGYLFGSNRAATPFSDSWYYPSYPPSY 
PGTWNRAYSPLHGGSGSYSVCSNSDTKTRTA 

SGYG GTRRR — —i 

THVVMTGMCYAPHQVLSYINGVTrSKPGVSL 



DKQGKSRGHSlKTLiiLN VLvrr/vrro^^i^yvj v 

PHVGANVTLSCQSPRSKPAVQYQWPRQLPSF 

OTFFAPALDVIRGSLSLTNLSSSMAGVYVCKA 

HNEVGTAQCNVTLEVSTOPQAAWAGAWO 

TLVGLOLLAGLVLLYHRROKALEEPANDIKE 

DAIAPRTLPWPKSSDTISKNGTLSSVTSARAL 

RPPHGPPRPGMTFIPSlJSSQAIPSPWJrrTDG 

AHPQPISPIPGGVSSSGLSRMGAVPVMVPAQS 

gLARRPKQQSSEKSRNMlRNWLTlFILFPLKLV 

EKCESSVSLTVPPVVKLENGSSTNVSLTLRPP 

LNATLVrrFErWRSKNmLELPDEVVVPPGVT 




FLVIRSSAISHNQ YIO Wiy !• V AWOior i rv » 

NWRRKSVIGLSFDFVALNLTGFVAYSVFNIGL 

LWVPYIKBQFLLKYPNGVNPVNSNDVFFSLH 

AWLTLIIIVQCCLYEROGQRVSWPAIGFLVL 

AWIJAFVTMIVAAVGVrnVLQFLFCFSYIKL 

AVTLVKYFPQAYMNFYYKSTEGWSIGNnX 

DFTGGSFSLLQMFLQSYNNDQWTLIFGDPTK 

FGLGVFSIVFDVVFnQHFCLYRKRPGYPQLN 




MQPPGPPPAYAFTNUUf irvjsaAu/vEdJi^vju* 

Aa>DVKLNLGGDFIKESTATrFLRQRGYGWL 

LEVEDDDPEDNKPLLEELDIDLKDIYYKIRCV 

LMPMPSLGFNRQWRDNPDFWGPLAWLFFS 

MISLYGQFRVVSWnXIWIFGSLTIFLLARVLG 

GEVAYGQVLGVIOYSLLPLIVIAPVLLWGSF 

EWSTLIKLFGVFWAAYSAASLLVGEEFKTK 

KPLL IYPIFLLYIYFLSLYTGV ^ 

MTCFkGQKOEQRSHAFEANKDHKAKYPlSi'W 
LYSQLNALQFTVDERSILWLNQFLLDLKQSL 
NOFMAVYKLNDNSKSDEHVDVRVDGLMLK 
mi^cwwfiPrHOnOPRAISIOSSEMIATNTRH 




mNFNLUIPIFQRHAHEQDTKMHEIYKGNITP 

« vn^rt«r VPC A ATnVWAWPSOFWIDYEGM 



KSGKGRPISFVDSFPLSlwit;^?r lis. I 

OTCNQVSLNTSQSESSDLAGRLKRKKLLKEY 

YSTESEPLTOGGQKPSSSDTFFRFSPSSSEADI 

HLLVHVMCHVSMQINHYQYLLLLFLHESLILL 

SENLRKDVEAVTGSPASQTSICIGELLRSAELA 

, T 1 Trti^mrvAxm vsiPV??P<?VSPWPDYLPTEN 




LSDKHLGKISEDES5ULVifJ!.^vjaur.ivjocio*^ 
KKDSFYTDSSSVLNYREDSNILSFDSDGNQNI 
LSSTLTSKGNETIESIFKAEPLLPEAASLSE^a> 
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SEQ ID 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


ivlet 1 
[lod 

1 
j 


SEQ ] 
[DNO: 1 
in 

USSK 
09/496 
914 


Predicted 

[)eginning 

ttucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted eod 
[lucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aianme u=ijysieme. 
D=Aspartic Acid, E^Glutamic Acid. 
F«PhcnylalBnine, G=Glycine, H=Histidinc» 
I»Isolcudnc, KpLysine, LHLcucinc, 
M=Methionine, N=Asparagine. P=Proline, 
C^lutaminc, R=Arginine, S=Serine, 
T=Threonine, Vs=Valine, W=»Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon, 
A=possible nucleotide deletion, V=possible 
nucleotide insertion 














DISKEETPPVRTUCSQSSLSGKPKERCPPNLAP 

IXVSYKNK!KRSSSQMSU)TISU>SMa.EEQLL 

ESDGSDSHMnJEKGNKKNSTTimGTAESYN 

AGANLQNYGETSPDAISTNSBGAQENHDDLM 

SVWFKrrOVNGEIDIRGEDTEICLQVNQVTP 

DQLG>n[SUWYlXmPVGSDQKAVIHSKSSPE 

ISLRFESGPGAVIHSLLAEKNGFLQCHIENFST 

EFLTSSLMNIQHFI^ETVATVMPMKIQVSNT 

KINLKDDSPRSSTVSLEPAPNrrVHIDHLVVER 

SDDGSFHnU)SHMLmX3>©LKENVKSDSVLL 

TSGKYDLKKQRSVTQATQTSPGVPWPSQSAN 

FPEFSFDFTREQLMEENESLKQELAKAKMAL 

AEAHLEKDALLHHDCKMTVE 


"961 


2311 


A 


8172 


1442 


682 


TAAMSnvTPTNOIRLTNVAVVRMkkAUJUU'lsi 

ACYra^WGWRSGVEKDLDEVUJflHSVFVN 

VSKGQVAKKEDUSAFGTDDQTEICKQE.TKG 

EVQVSDKERHTQLEQMFRDIATIVADKCVNP 

KnOU'YTmiERANIKDIHYSVKTNKSTKQQA 

l^KQLKEKlsmERAHMRLRFn-PVNEGKKL 

KEBCLKPLKVIESEDYGQQLEIVCLIDPGCFREI 

ncT nf If PTKGKOSLEVLNLKDVEEGDEKFE 


962 


2312 


A 


8175 


286 


587 


>nSNKAEVSSHPSVISHSMDSFOQPRPEDNQS 

VLRRMQKKYWKTKQVFIKATGKKEDEHLVA 

SDAELDAKIXVFHSVQErOTLLKIIEKYQLR 

LNGMKS 


963 


2313 


A 


8181 


13 


2215 


AEGCAERRGTEPWELSMSWESGAGPGLUSiV^ 

GMDLVWSAWYOKCVKGKGSLPLSAHGIW 

AWLSRAEWDQVTVYLFCDDHBaQRYALNRI 

TVWRSRSGNELPLAVASTADLIRCKLLDVTG 

GLGTDELRLLYGMALVRFVNLISERK'TKFAK 

mKClJVQEVNIPDWlWLRHELTHKKMPHI 

NDCRRGCYFVLDWLQKTYWCRQLENSLRET 

WELEEFREGIEEEDQEEDKNIWDDITEQKPE 

PQDDGKSTESDVKADQDSKGSEEVDSHCKK 

ALSHKELYERARELLVSYEEEQFTVLEKFRYL 

FKAIKAWNNPSPRVECVLAELKGVTCENREA 

VLDAFLDDGFLVFITEQLAALQIEYEENVDL 

T4DVLVPKPFSQFWQPLLRGLHSQNFTQALLE 

RMLSELPALGISGIRPTYILRWTVELIVANTKT 

GRNARRFSAGQWEARRGWRLFNCSASLDWP 

RMVESCLGSPCWASPQLLRUFKAMGQGLPD 

EEQEKLLRICSIYTQSGENSLVQEGSEASPIGK 

SPYTLDSLYWSVKPASSSFGSEAKAQQQEEQ 

GSVNDVKEEEKEEKEVLPDQVEEEEENDDQE 

EEEEDEDDEDDEEEDRMEVOPFSTGQESPTA 

ENARLLAQKRGALQQSAWQVSSEDVRWDTF 

PLGRMPGQTEDPAELMLENYDTMYUXIQPV 

LEQRLEPSTCKTDTLGLSCGVGSGNCSNSSSS 

>JFEGLLWSOGOLHGLKTGLOLF 


964 


2314 


A 


8184 


6 


1393 


" ■ rrnjTsxrcD r^nvi'i 'w Pk' 1 kGRTRGRRRRACRSAE 
OTOLRSLLLPPRLQLPAGPFSRCRWDPVSSPR 
PSTMPPKKGGDGDCPPPnGRFGTSLKIGIVGLP 
NVGKSTFFNVLTNSQASAENFPFCnDPNESR 
VPVPDERFDFLCQYHKPASKJDPAFLNWDIAG 
LVKGAHNGQGLGNAFLSmSACDGIFHLTRA 
FEDDDITHVEGSVDPIRDIEIIHEELQLKDEEMI 
GPUDKLEKVAVRGGDKKLKPEYDIMCKVKS 
WVTOQKKPVRFYHDWNDKEIEVLNKHLFLTS 
KPMVYLVNLSETOYIRKKNKWLIKIKEWVD 
KYDPGALVIPFSGALELKLOELSAEERQKYLE 
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SEQIU 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
tiod 


SEQ 

©NO: 

in 

USSN 
09/496 
914 


Predicted 
beginning 

niirleAtide 

location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted endT" 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alamne OCysteine, 
D-Aspartic Acid, EKSlutamic Acid, 
F«=Phcnylalaninc, G=Qlycinc, H»Histidine, 
]=Isoleucine> K«Lysine, L=Lcucine, 
M=4vlethionine, N=Asparagine. P=Proline. 
QMjlutamine. R=Arginine, S=Serine, 
T^Tireonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \«possible 
nucleotide insertion 














ANMTQSALPKinCAGFAALQLEYFUAUi'utiv 
RA\mRK(nKAPQAAGKlHTDFEKGFIMAEV 
MKYEDFKEEGSENAVKAAGKYRQQGRNYIV 
EDGDIDFFKFNTPQQPKKK 


965 


2315 


A 


8195 


1437 


"594 


RSFSLSFSIXSPSEMMALGAAGATRWVAMV 

aaalgghpllgvsati;nsvlksnaiknlpppl 

GGAAGHPGSAVSAAPGILYPGGNKYQTIDNY 

OPyPCAEDEECGTDEYCASFTRGGDAGVQIC 

LACRKKMOICMRHAMCCPGNYCKNGICVSS 

DQNHFRGEDSEnTESFGNDHSTLDOYSRRTT 

LSSKMYHTKGQEGSVCXRSSDCASGLCCARH 

FWSKICKPV1JKEGQVCTKHRRK,GSHGLEIFQ 

RCYCGEGLSCRIOKDHHQASNSSKLHTCQRH 


966 


2316 


A 


8207 


416 


4082 


KFKLIKIMLL'ILIILLPVVSKi''SFVSLSAPQHW 

SCPEOTLAGNGNSTCVOPAPFLIFSHGNSIFRI 

DTEGTNYEQLWDAGVSVIMDFHYNBKRIY 

WVDLERQLLQRVFLNGSRQERVCNIEKNVSG 

MAINWINEEVIWSNQQEGIITVTDMKGNNSHI 

LLSALKYPAKVAVDPVERFIFWSSEVAGSLY 

RADLDGVGVKALLETSEKITAVSLDVLDKRL 

FWIQYNREGSNSLICSCDYDGOSVHISKHPTQ 

HNLFAMSLFGDRIFYSTWKMKTIWIANiCHTG 

KDMVRINLHSSFVPLGELKWHPLAQPKAED 

DTWEPEQKLCKLRKGNCSSTVCGQDLQSHLC 

MCAEGYALSRDRKYCEGNDWKYCEDVNEC 

AFVmiGCTLGCKNTPGSYYCTCPVGFVLLPD 

GKRCHQLVSCPRNVSECSHDCVLTSEGPLCF 

CPEGSVLERDOKTCSGCSSPDNGGCSQLCVPL 

SPVSWBCDCFPGYDLQLDEKSCAASGPQPFL 

iJANSQDIRHMHFDGTDYGTLLSQQMGMVy 

ALDHDPVENKIYFAHTALKWIERANMDGSQ 

RERLDEEGVDVPEGLAVDWIGRRFYWTDRGK 

SLIGRSDLNGKRSKinriENISQPRGIAVHPMAK 

RLFWTDTGINPRIESSSLQGLGRLVIASSDUW 

PSGrriDFLTDKLYWCDAKQSVIEMANLDGSK 

RRRLTQNDVGHPFAVAVFEDYVWFSDWAMP 

SVmVNKRTGKDRVRLQGSMLKPSSLVWHP 

LAKPGADPCLYQNGGCEHICKKRLGTAWCS 

CREGFMKASDGKTCLALDGHQLLAGGEVDL 

KNQVTPLDILSKTRVSEDNITESQHMLVAEIM 

VSDQDDCAPVGCSMYARaSEGEDATCQCLK 

GFAGDGKLCSDIDECEMGVPVC3'PASSKCmT 

EGGYVCRCSEGYQGDGIHCLDIDECQLGVHS 

CGENASCTNTEGGYTCMCAGRLSEPGLICPD 

STPPPHLREDDHHYSVRNSDSECPLSHDGYCL 

HDGVCMYIEALDKYACNCWGYIGERCQYR 

DLKWWELRHAGHGQQQKVIWAVCVVVLV 

NOXLLSLWGAHYYRTQKLLSKNPKNPYEESS 

RDVRSRRPADTEDGMSSCPQPWFWIKEHQD 

t yj"^xr>ryi\D\rAnxrrsnr\A AnnSMOPTSWROEPO 

LCGMGTEQGCWIPVSSDKGSCPQVMERSFH 

MPSYGTQTLEOOVEKPHSLLSANPLWQQRAL 

DPPHOMELTQ 


967 


' 2317 


A 


8210 


3 


"601 


■ SSAMGSRSSHAAVIPDGDSIRRETGFSQASLL 
RLHHRFRALDRNKKGYLSRMDLQQIGALAV 
NPLGDRIIESFFPDGSQRVDFPGFVRVLAHFRP 
VEDEDTETQDPKKPEPLNSRRNKLHYAFQLY 
DLDRDGKISRHEMLQVLRLMVGVQVTEEQL 
ENIADRTVQEADEDGDGAVSFVEFTKSLEKM 
DVEHKMSmiLK 
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SEQDD 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ED NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
I>Aspartic Acid, E=Glutamic Acid, 
F^Phcnylalanine, G«Glycine, H=Histidine, 
I^Isoleucine, K^Lysine, L*=lcucine, 
M=Methionine, N=Aspaiagine, P^Proline, 
Q=Glutamine, R=Argininc, S=6erine, 
T=Threonine, V=Valine, W-Ttyptophan, 
YsTyrosine, X=Unknown, *=Stop codon, 
A=possible nucleotide deletion, V=possibIe 
nucleotide insertion 


968 


2318 


A 


8211 


2 


409 


ISSCPHTAYEGSMSTLSNFTQTLEDVFRRIFIT 

YMDNWRQNTTAEQEALQAKVDAENFYYVIL 

YLMWGN^SFIIVAILVSTVKSKRREHSNDP 

YHQYIVEDWQEKYKSQILNLEESKA'nHENIG 

AAGFKMSP 


969 


2319 


A 


8215 


1 


1938 


GMPRSRGGRAAPGPPPPPPPPGQAPRWSRWR 

VPGRLLLLLLPALCCLPGAARAAAAAAGAGN 

RAAVAVAVARADEAEAPFAGQNWLKSYGY 

LLPYDSRASALHSAKALQSAVSTMQQFYGIP 

VTGVLDQTTEWMKKPRCGVPDHPHLSRRRR 

NKRYALTGQKWRQKHITYSIHNYTPKVGELD 

TRKAIRQAFDVWQKVTPLTFEEVPYHEIKSDR 

KEADIMIFFASGFHGDSSPFDGEGGFLAHAYF 

PGPGIGGDTTHFDSDEPWTLGNANHDGNDLFL 

VAVHELGHALGLEHSSDPSAIMAPFYQYMET 

HNFKLPQDDLQGIQKIYGPPAEPLEPTRPLPTL 

PVRRIHSPSERKHERQPRPPRPPLGDRPSTPGT 

KPNICDGNFNTVALFRGEMFVFKDRWFWRJL 

RNNRVQEGYPMQIEQFWKGLPARIDAAYER 

ADGRFVFFKGDKYWVFKEVTVEPGYPHSLG 

ELGSCLPREGIDTALRWEPVGKTYFFKGERY 

WRYSEERRATDPGYPKPTTVWKGIPQAPQGA 

FISKEGYYTYFYKGRDYWKFDNQKLSVEPGY 

PRNILRDWMGCNQKEVERRKERRLPQDDVDI 

MVTINDVPGSVNAVAVVIPCILSLCILVLVYTl 

FQFKNKTGPQPVTYYKRPVQEWV 


970 


2320 


A 


8216 


1235 


2223 


SRLSLQFYVSFRRTGLFTCKLIVEIFFRNYMN 

DSLRTNVFVRFQPETIACACIYLAARALQIPLP 

TRPHWFLLFGTTEEEIQEICIETLRLYTRKKPN 

YELLEKEVEKRKVALQEAKLKAKGLNPDGTP 

ALSTLGGFSPASKPSSPREVKAEEKSPISINVK 

TVKKEPEDRQQASKSPYNGVRKDSKRSRNSR 

SASRSRSRTRSRSRSHTPRRHYNNRRSRSGTY 

SSRSRSRSRSHSESPRRHHNHGSPHLKAKHTR 

DDLKSSNRHGHKRKKSRSRSQSKSRDHSDAA 

KKHRHERGHHRDRRERSRSFERSHKSKHHGG 

SRSGHGRHRR 


971 


2321 


A 


8217 


3 


3274 


DCRLQAAMPTNFT WPVEAHADGGGDE 1 Afc 

RTEAPGTTEGPEPERPSPGDGNPRENSPFLNN 

VEVEQESFFEGKNMALFEEEMDSNPMVSSLL 

NKLANYTNLSQGWEHEEDEESRRREAKAPR 

MGTFIGVYLPCLQNULGVILFLRLTWIVGVAG 

VLESFLrVAMCCTCTKILTAISMSAIATNGVVP 

AGGSYYMISRSLGPEFGGAVGLCFYLGTTFA 

GAMYILGTDEIFLTYISPGAAIFQAEAAGGEAA 

AMLHNMRVYGTCTLVLMALWFVGVKYVN 

KLALVFLACWLSILATV^AGVIKSAFDPPDIPV. 

CLU3NRTLSRRSFDACVKAYGIHNNSATSAL 

WGLFCNGSQPSAACDEYFIQNNVTEIQGIPGA 

ASGVFLENLWSTYAHAGAFVEKKGVPSVPV 

AEESRASTLPYVLTDIAASFaLVGIYFPSVTG 

IMAGSNRSGDUCDAQKSIPTGTIUUVTTSFIY 

LSCrVLFGACIEGVVLRDKFGEALQGNLVIGM 

LAWPSPWVIVIGSFFSTCGAGLQTLTGAPRIX 

QAIARDGIVPFLQVFGHGKANGEPTWALLLT 

VLICETGILIASLDSVAPILSMFFLMCYLFVNL 

ACAVQTLLRTPNWRPRFKrraWTLSFLGMSL 

CLALMFICSWYYALSAMLIAGCIYKYIEYRG 

AEKEWGDGIRGLSLNAARYALLRVEHGPPHT 

K3WRPQVLVNILNLDAE0AMKHPRLLSFTSQ 
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1 SEQID : 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQID I 
1^0: of 1 
peptide 
seq- 
uence 


Viet I 
106 ] 

] 


5EQ ] 
[DNO: 1 
in 1 
USSN 
09/496 
914 


Predicted 

)eginning 

nucleotide 

location 

conrespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 1 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence {A=Alanine OCystein^ 
D=Aspartic Acid, B=<jlutamic Acid, 
F'Phenylalaninc, G-<jlycine, H«Histid*me, 
I=Isolcucinc KpLysine, LF^-eucine, 
M=Methionine, N«A5iwiagine, P^Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T«Threonine, VoValine. W^Tiyptophan, 
y=Tyrosine» X-Unknown, ♦^Slop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide iosertion 














LKAGKGLTIVGSVLEGTYLDKHMEAQRAKJi 

NIRSLMS-reKTKGFCQLWSSSLRDGMSHLIQ 

SAGLGGLKHNTVIJVIAWASWKQEDNPFSW 

KNFVDTVTIDTTAAHQALLVAKNVDSFPQNQ 

ERFGGGHIDVWWIVHDGGMLMLLPFLLRQH 

TCVWWCCmiRIFTVAOVDDNSIQMKKDLQMF 

LYHLRISAEVEVVEMVEhnilSAmERTLMM 

EQRSQMLKQMQLSKNEQEREAQLIHDRNTAS 

HTAAAARTQAPPTPDKVQMTWTOEKLIAEK 

YRSRDTSLSGFKDLFSMKPDQSNVBRMHTAV 

KLNGWLNKSQDAQLVLLNMPOPPKNRQGD 

KNYN4EFLEVLTEGLNRVLLVRGGGREVrnYS 


972 


2322 


A 


8224 


701 


246 


T'SRRVtMKFNPFV'l^iURSKm^ 
RRKIMSSPLSKELRQKYNVRSMWRKDDEVQ 
WRGHYKGQQIGKWQVYRKKYVIYIBRVQ 
REKANGTTVHVGIHPSKVVrrRLKLDKDRKKI 
LERKAKSROVOKEKGKYKEELIEKMQE 


973 


2323 


A 


8237 


873 


4610 


GCPHAGGKGRVPTGGLTGGRl'WSPSAAl'KSU 

PRPGPTPAPGAMDKLPPSMRKRLYSLPQQVG 

AKAWIMDEEEDAEEEGAGGRQDPSRRSIRLR 

PLPSPSPSAAAGGTESRSSALGAADSEOPARG 

AGKSSTNGDCRRFRGSLASLGSRGGOSGGTO 

SGSSHGHLHDSAEERRLIAEGDASPGEDRTPP 

GLAAEPERPGASAQPAASPPPPQQPPQPASAS 

CEQPSVDTAIKVEGGAAAGDQILPEAEVRLG 

QAGFMQRQFGAMLQPGVNKFSLRMFGSQKA 

VEREQERVKSAGFWnHPYSDFRFYWDLTML 

LLMVG^«-IIIPVGr^TKDE^mPWIVFNVVSD 

TFFLroLVLNFRTGrVVEDNTEIILDPQRIKMK 

YLKSWFMVDFISSIPVDYIFLIVETRIDSEVYK 

TARALRIVRFTKILSLLRLLRLSRLIRYIHQWE 

EIFHNn^LASAVVRIVNLIGMMLLLCHWDG 

CLQFLWMLQDFPDDO^SINNMVNNSWGK 

QYSYALFKAMSHMLCIGYGRQAPVGMSDV 

WLTMLSMIVGATCYAMFIGHATALIQSLDSS 

RRQYQEKYKQVEQYMSFHKLPPDTRQRIHD 

YYEHRYQOKMFDEESILGELSEPLREEIINFNC 

RKLVASMPIJANADPW^MLTKLRFEVFQ 

PGDYIIREGTIGKKMYnQHGWSVLTKGNKE 

TKLADGSYFGEiaLTRGRRTASVRADTYCR 

LYSLSVDNFNEVLEBYPMMRRAFETVALDRL 

DRIGKKNSILLHKVQHDLNSGVFNYQENEUQ 

QIVQHDREMAHCAHRVQAAASATPTPTPVIW 

TPLIQAPLQAAAATTSVAIALTHHPRLPAAIFR 

PPPGSGLGNLGAGQTPRHLKRLQSLIPSALGS 

ASPASSPSQVDTPSSSSFHIQQLAGFSAPAGLS 

PLLPSSSSSPPPQACGSPSAPTPSAGVAATTIA 

GFGHFHKALGGSLSSSDSPLLTPLQPGARSPQ 

AAQPSPAPPGARGGLGLPEHFLPPPPSSRSPSS 

^/-^TiiM^Tji OT 1^1 AT/lPI CTPETPP'ROPEP 
SPGQLGQPPGBLSLG*^ l urL»o l rii i rrivv* 

PSLVAGASGGASPVGFTPRGGLSPPGHSPGPP 
RTFPSAPPRASGSHOSLLLPPASSPPPPQVPQR 
RGTPPLTPGRLTQDLKUSASQPALPQDGAQT 
LRRASPHSSGESMAAFPLFPRAGGGSGGSGSS 
GGLGPPGRPYGAIPGQHVUPRKTSSGSLPPP 
LSLFGARATSSGGPPLTAGPQREPGARPEPVR 

SKLPSNL 


974 


2324 


A 


8247 


279 


468 


" ■ EYKQWERRFLSCQNRNDLGYGKPRKGUULL 
LVPVKDASRICSLTYLLGSHWNNLWRSPAO- 

G 
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SEQ CD 
NO: of 1 
nucl- 
eotide 
seq- 
uence 


SEQ ED ] 
NO: of 
3eptide 
seq- 
uence 


Mlet 1 ' 
bod ] 

1 


SEQ ] 

[DNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«=Alanine OCysteme, 
I>Aspartic Acid, E=Glutamic Acid, 
F«Phenylalamne» G*=Glycine, HNHistidine, 
I^Isoleudne, K=Lysine, IM^cine, 
M^Methionine. N^Asparaglne, P^Proline, 
Q=Glutamine, R=Arginine, S=»Serine, 
T=Thrconine, V=VaUne, W=Tiyptophan, 
Y=Tyn)sine, X=Unknown, ♦=^p codon, 
^^ssible nucleotide deletion, Vpossible 
nucleotide insertion 


973 


2325 


A 


8249 


62 


T371 


LVALKNWKPKOTNIPAPQSFVFGEAVSOVYM 

MTKVLGMAPVLGPRPPQEQVGPLMIVKVEEK 

EEKGKYLPSLEMFRQRFRQFGfYHDTPGPREA 

LSQLRVLCCEWLRPEIHTKEQILELLVLEQFLT 

DLPQELQAWVQEHCPESAEEAVTLLEDLEREL 

DEPGHOVSTPPNEQKPVWEKISSSGTAKESPS 

SMQPQPLETSHKYESWGPLYIQESGEEQEFAQ 

DPRKVRDCRLSTQHEESADEQKGSEAEGLKG 

DUSVIIANKPEASLERQCVNLENEKGTKPPLQ 

EAGSKKGRESVPTKPTPGERRYICAECGKAFS 

NSSNLTKHRRTHTGEKPYVCTKCGKAFSHSS 

NLTLHYRTHLVDRPYDCKCGKAFGQSSDLLK 

HQiyilHTEEAPYQaCDCGKAFSGKOSURHyR 

fflTGEKPYQCNECGKSFSQHAGLSSHQRLHT 

GEKPYKCKECGKAFNHSSNFNKHHRIHTGEK 

PYWCHHCGKTFCSKSNLSKHQRVHTOEGEA 

P 


976 


2326 


A 1 


8257 


298 


7086 


■GNMACWPQLRLLLWKNLTFKRRQlLQLLLb 

VAWPLFffLDLISVRLSYPPYEQHECHFPNKAM 

PSAGTLPWVQGnCNANNPCFRYPTPGEAPGV 

VGNFNKSIVARLFSDARRLLLYSQKDTSMKD 

MRKVLRTLQQIKKSSSNLKLQDFLVDKETFS 

GFLYHNLSLPKSTVDKMLRADVILHKVFLQG 

YQLHLTSLCNGSKSEEMIQLGDQEVSELCGLP 

REKLAAAERVLRSNMDILKPILRTLNSTSPFPS 

KELAEATKTLLHSLGTLAQELFSMRSWSDMR 

QEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEG 

GGLKIKSLNWYEDNNYKALFGGNOTEEDAE 

TFYDNSTTPYCNDLMKNLESSPLSRIIWKALK 

PLLVGKILYTPDTPATRQVMAEVNKTFQELA 

VFHDLEGMWEELSPKIWTFMENSQEMDLVR 

MLLDSRDNDHFWEQQLDGLDWTAQDIVAFL 

AKHPEDVQSSNGSVYTWREAFNETNQAIRTIS 

RFMECVNLNKLEPIATEVWLINKSMELLDER 

KI^AOIVrrOITPGSIELPHHVKYKIRMOIDN 

VERTNKIKDGYWDPGFRADPFEDMRYVWGG 

FAYLQDWEQAIIRVLTGTEKKTGVYMQQMP 

YPCm^DIFLRVMSRSMPUMOAWIYSVAV 

inCGIVYEKEARLKETMRIMGLDNSILWFSWFI 

SSLIPLLVSAGLXWILKLGNIXPYSDPSWFV 

FLSVFAWnLQCFUSTLFSRANLAAACGOn 

YFTLYLPYVLCVAWQDYVGFTLKIFASLLSP 

VAFGFGCEYFALFEEQGIGVQWDNLFESPVE 

EDGFNLTTSVSMMLFDmYGVMTWYIBAVF 

PGQYGIPRPWYFPCTKSYWFGEESDEKSHPGS 

NQKRISEICMEEEPTHLKLGVSIQNLVKVYRD 

GMKVAVDOLALNFYEGQITSFLGHNOAGKT 

TTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQ 

NLGVCPQHNVLFDMLTVEEHIWFYARLKGLS 

^ . nxrcrwAM TW/nj P^^IKT KSKTSOLS 
EKHVKAEMxil^MALilJ V our o or»J-»rkOiw » w» V*'*' 

GGMQRKLSVALAFVGGSKVVBLDEPTAGVDP 
YSRRGWELLLKYRQQRTIILSTHHMDEADVL 
GDRIAnSHGKLCCVGSSLFLKNQLGTGYYLT 
LVKKDVESSLSSCRNSSSTVSYLKKEDSVSQS 
SSDAGLGSDHESDTLTIDVSAISNLIRKHVSEA 
RLVEDIGHELTYVLPYEAAKEGAFVELFHEDD 
DRLSDLGISSYGISETTLEEIFLKVAEESGVDA 
ETSDGTLPARRNRRAFGDKQSCLRPFTEDDA 
ADPNDSDIDPESRETDLLSGMDGKGSYQVKG 
WKLTQQQFVALLWKRLUARRSRKGFFAQIV 
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"seq'id I s 

NO: of 1 
nucl- 1 I 
eotide 2 
seq- 1 
uence 


SEQ ID I 
<10 : of t 
)eptide 
»eq- 
jence 


^et 5 
lod I 
i 
1 

( 
i 


>EQ i 
DNO: I 
n I 
JSSN 
39/496 
?14 


Predicted ] 

jeginning 

lucleotide 

ocation 

sorrespondi 

[ig to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequenoe(A=Alanme u=C7Sieine, 
D=^Aspartic Acid, E^lutamic Acid, 
F«Phenylalanine, G=<jlycine, H'^Histidine, 
Msoleudne, K«Lysine, IMxucinc, 
M=Methionine, N=Asparagine, P=Proiine, 
Q=<jlutamine» R«Arginine, S=Serine. 
T^Threonine, V^Valine^ W«=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
^=possible nucleotide deletion, V=possible 
nucleotide insertion 














LPAVFVCIALVFSUVPPFGKYPSLELQPWMy 

NEQYTFVSNDAPEDTGTLELLNALTKDPOFG 

TRCMEGNPIPDTPCXJAGEEEWTTAPVPQTIM 

DLFQNGNWTMQNPSPACQCSSDKIKKMLPV 

CPPGAGGLPPPQRKQNTADILQDLTGKNISDY 

LVKTYVQEIAKSLKNKIWYNEFRYGGFSLOVS 

NTQALPPSQEVNDATKQMKKHLKLAKDSSA 

DRFLNSLGRFMTGLDTRNNVKVWFNNKGW 

HAISSFLNVINNAILRANLQKGENPSHYGrrAF 

NHPLNLTKQQLSEVAPMTTSVDVLVSICVIFA 

MSFVPASFWFLIQERVSKAKHLQnSGVKPVI 

YWLSNFVWDMCOTVVPATLVniFICFQQKSY 

VSSTNU^NOJOXLIXYGWSITPLMYPASFVFK 

IPSTAYWLTSVNLnGINGSVATFVLELFTDN 

KU«ININDE.ICSVFLIFPHFaGRGLIDMVKNQ 

AMADALERFGENRFVSPLSWDLVGRNLFAM 

AVEGWFFUTVUQYRFFIRPRPVNAKLSPLN 

nTDPnVRRFRORILDGGQONDILEIKELTKIY 

RRKRKPAVDRICVdPPGECFGLLGVNGAGK 

SCTFKMLTGDTTVTRGDAFLNRNSILSNIHEV 

HONMGYCPQFDArrBLLTGREHVEFFALLRG 

VPEKEVGKVGEWAIRKLOLVKYGEKYAGNY 

SGONKRKLSTAMALIGGPPVVFLDEPTTGMD 

PKARRFLWCALSWKEGRSWLTSHSMEEC 

EALCniMAIMVNGRFRCLGSVQHLKNRFGD 

GYTIVVRIAGSNPDLKPVQDFFGLAFPGSVPK 

EKHRNMLQYQLPSSLSSLARIFSILSQSKKRLH 

lEDYSVSQTTLDQVFVNFAKDQSDDDHLKDL 

SimNOTWDVAVLTSFLQDEKVKESYV 


977 


2327 


A 


8260 


3 


1567 


TPGSTISFSLCFIFPPCVPTMVRKPVVSllbK.UU 
YLQGNVNGRLPSLGNKEPPGQEKVQLKRKV 
IXLRGVSfflGTnOAGIFISPKGVLQNTGSVGM 
SLTIWTVCGVLSLFGALSYAELGTTIKKSGGH 
YTYILEVFGPLPAFVRVWVELLIIRPAATAVIS 
LAFGRYILEPFFIQCEIPELAIKLITAVGITWM 
VLNSMSVSWSARIQIFLTFCKLTAILmVPGV 
MQLIKGQTQNFKDAFSGRDSSITRLPLAFYYG 
MYAYAGWFYLNFVTEEVENPEKTIPLAiaSM 
AIVUGYVLINVAYFTTINAEELLLSNAVAVT 
FSERLLGNFSLAVPIFVALSCFGSMNGGVFAV 
SRLFYVASREGHLPEILSMIHVRKHTPLPAVIV 
LHPLTMIMLFSGDLDSLLNFLSFARWLFIGLA 
VAGLIYLRYKCPDMHRPFKVPLFIPALFSFTC 
LFMVALSLYSDPFSTGIGFVITLTGVPAYYLFn 
WDKKPRWFRIMSEKITRTLOIILEVVPEEDKL 


978 


2328 


A 


8261 


2 


" 2165 


" RGGSLRCVLGKLLGQLLCFQSERCVRbPbOLL 
RHRGCGLLSSRLSAGKPPLRTSFFGSWGVLPP 
LADAASMSGVRAVRISIESACEKQVHEVGLD 
GTETYLPPLSMSQNLARLAQRIDFSQGSGSEE 
trcA Ar"rcr;nAnPWPfiAnsSADODDEEGWK 
FQPSLWPWDSVRNNLRSALTEMCVLYDVLSI 
VRDKKFMTLDPVSQDALPPKQNPQTLQLISK 
KKSLAGAAQUXKGAERLTKSVTENQENKLQ 
RDFNSELLRLRQHWKLRKVGDKILGDLSYRS 
AGSLFPHHGTFEVIKNTDLDLDKKIPEDYCPL 
DVQIPSDLEGSAYIKVSIQKQAPDIGDLGTVN 
LFKRPLPKSKPGSPHWQTKLEAAQNVLLCKEI 
FAQLSREAVQKSQVPHIWKNQnSQPFPSLQ 
LSISLCHSSNDKKSQKFATEKQCPEDHLYVLE 
HNIJHLLIREFHKQTLSSIMMPHPASAPFGHKR 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 1 
NO: of 1 
peptide 
seq- 
uence 


Viet 
iiod 


5EQ 

[DNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

corrcspondi 

ng to first 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alamne OCystelne, 
D=Aspartic Acid, E=Glutamic Acid, 
F=4*henylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K-Lysine, L«Leucine, 
M«Mcthioninc, N=Asparaginc, P«Proline, 
Q=Glutamine, R»Argmine, S=Serine, 
•MThreonine. V=Valine, W=TTyptophan, 
Y«Tyrosine, X=Unknown, ♦^^top codon, 
/=possible nucleotide deletion, V^ossible 
nucleotide insertion 














MRLSGPQAFDKNBINSLQSSEOLLEKIIKQAK. 

HIRASRAAATIDSLASRIEDPQIQAHWSNIND 

VYESSVKVLrrSQGYEQiaCSIQLQLKIGVEQI 

RWHRDGRVTTLSYQEQELQDFaSQMSQHQ 

VHAVQQLAKVMOWQVLSFSNHVGLGPIESIG 

NASATTVASPSGDYAISVIWGPESGSKIMVQF 

PRNQCKDLPKSDVLQDNKWSHLRGPFKEVQ 

WNKMEGRNFVYKMELLMSALSPCLL 


979 


2329 


A 


8289 


2 




"FVWNPRGGRKRRJRQAAVTQAATRASUi rej' 
RDGTMTQOKLSVANKAPGTEGQQQVHGEKK 
EAPAVPSAPPSYEEATSGEGMKAGAFPPAPTA 
VPLHPSWAYVDPSSSSSYDNGFPTODHELFTT 
FSTODQKVRRVFVRKVYmUQLLVTLAVV 
ALFTFCDPVKDYVQANPGWYWASYAVFFAT 
YLTLACCSGPRRHFPWNLHXTVFTLSMAYLT 
GMLSSYYNTTSVLLCLGITALVCLSVTVFSFQ 
TKFDFTSCQGVLPVIXMTLFFSGLILAILLPFQ 
YVPWLHAVYAALOAGVFTLFLALDTQLLMG 
MRRHSLSPEEYIFGALNIYLDIIYIFIFFLQLFG 

TORE 


980 


2330 


A 


8305 


59 


857 


"ASQLPDYSISPPSLPPRISFHPSFTLARVAMAKF 
SEATQSHSISSSSFGAEPSAPGGGGSPGACPAL 
GTKSCSSSCAVHDUFWRDVKKTQFVFGTTLI 
MLLSLAAFSVISVVSYLILALLSVTISFRIYKSV 
IQAVQKSEEGHPFKAYLDVDITLSSEAFHNY 
MNAAMVHINRALKLIIRLFLVEDLVDSLKLA 
VFMWLMTYVGAVFNGITLLlLAEaiFSVPIV 
YEKYKTQIDHYVGIARDQTKSIVEKIQAKLPG 
lAKKKAE 


981 


2331 


A 


8308 


186 


1337 


TRMSRHEGVSCDACLKGNFRGRRYKCUCYD 

YDLCASCYESGATTTRHTTDHPMQCILTRVD 

FDLYYGGEAFSVEQPQSFTCPYCGKMGYTET 

SLQEHVTSEHAETSTEVICPICAALPGGDPNH 

VTDDFAAHLTLEHRAPRDLDESSGVRHVRR 

MFHPGRGLGGPRARRSNMHFTSSSTGGLSSS 

QSSYSPSNREAMDPIAELLSQLSGVRRSAGGQ 

LNSSGPSASQU5QLQMQLQLERQHAQAARQ 

QI^TARNATRRTNTSSYTTrnX^STATTNlAN 

TESSQQTLQNSQFLLTRLNDPKMSETERQSM 

ESERADRSLFVQELLLSTLVREESSSSDEDDR 

GEMADFGAMGCVDIMPLDVALENLNLKESN 

KGNEPPPPPL 


982 


2332 


A 


8315 


1 


1004 


' "GSTHASADAWAQWFCTBALVMGAPV W YLV 
AAALLVGFILFLTRSRGRAASAGQEPLHNEEL 
AGAGRVAQPGPLEPEEPRAGGRPRRRRDLGS 
RLQAQRRAQRVAWAEADENEEEAVILAQEE 
EGVEKPAETHLSGKIGAKKLRKLEEKQARKA 
QREAEEAEREERKRLESQREAEWKKEEERLR 
T ciTDnvtJTJWFPlf ATiFFOAOREHEEYLKLKEA 
FWEEEGVGETMTEEQSQSFLTEFINYIKQSK 
WLLEDLASQVGLRTQDTmRIQDLLAEGTTT 
GVIDDRGKFIYITPEELAAVANFIRQRGRVSIA 
ELAOASNSLIAWGRESPAQAPA 


983 


2333 


A 


8320 


244 


1420 


■ RRRWRARGGLVPTLAWAEATGAYVPCxKUKi' 
DUamR>nilSAD4RKBGLRLAEDRSKDPHD 
PIKIYEFVNSOVGDFSQPDTSPDTOGGGSTSD 
TQEDILDELLGNMVLAPLPDPGPPSLAVAPEP 
CPQPLRSPSLDNPTPFPNLGPSENPLKRLLVPG 
EEWEFEVTAFYRGRQVFQOTISCPEGLRLVGS 
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SEQID 
NO: of 
nucl- 
eotide 
seq. 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 

nucleotide 
location 
coirespondi 
ngto first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 

OvlQ iCSlUUv 

of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteinc, 
D=Aspaitic Acid, B=Olutan)ic Acid, 
F^Phenylalanine, CHllycine^ H«Histidine, 
li^lsoleucine, K^Lysine;, L^Leucine, 
M=Methionine,N=Aspanigine, I^^Piolinc, 
Q=Glutamine, R^Arginine, ^SerinCi 
•MThreonine* V«Valinc» W=Tiyptophan, 
Y«Tyrosinc, X=Unknown, *^top codon, 
Aspossible nucleotide deletion, V=possible 
nucleotide Inseitlon 






1 








" EVGDRTLPGWPVTLPDPOMSLTDRGVMS YV 
RHVLSCLGGOLALWRAGQWLWAQRLGHCH 
TYWAVSEEaPNSOHOroGEVPKDKEGGVF 
DLGPFIVGSLGPPDLirFTEGSGRSPRYALWFC 
VGESWPQDQPWTKRLVMVKVVPTCLRALVE 
MARVGGASSLENTVDLHISNSHPLSLTSDQY 
KAYLQDLVEGMDFQGPGBS 


984 


2334 


A 


8321 


1 


1243 


ANMAPVEHWADAOAFLRHAALQDIOKNIY 

ITREWreiRDKATRRRLAVLPYELRFKEPLPE 

YVRL V IJiFSKKTGDiroLCiA i DU^ VLAL 1 1 KiL, 

EAEFVGVSHLKQEPQKVKVSSSIQHPETPLHIS 

GFHLPYKPKJrrQb 1 JalvunciAUJbriiiNl^tiroor M 

FWRNPLPNIDHELQELLroRGEDVPSEEEEEEE 

NGFEDRKDDSDDDGGGWrrPSNKQIQQELE 

QCDVPEDVRVGCLTTDFAMQNVLLQMGLHV 

I>VNG^^IREARSYIUlCHGCFKrTSDMSRV 

FCSHCGNKTUCKVSVTVSDDGTLHMHFSRNP 

KVLNPRGLRYSLPraCGGKYAINPHLTEDQRF 

PQLRLSQKARQKINVFAPDYIAGVSPFVENDI 

SSRSATLQVRDSTLGAGRRRLNPNASRKKFV 

KKR 


985 


2335 


A 


8322 


352 


529 


RRNNIRQFIMKVCISGQARWLTPWPVLWCT 
EAGRSLELKSLRPAWATWGNPISTKINK 


986 


2336 


A 


8325 


89 


1172 


KMNPTDIAUlTLUIiiSlYSW X xtiinairjvrv^irus 

GKAFGELFLPPLYSLVFVFQLLGNSVWLVL 

FKYKRLRSMTDVYLLNLAISDLLFVFSLPFWG 

YYAADQWVFGLGLCKMISWMYLVGFYSGIF 

FVMU^SIDRYLAIVHAVFSLIlARTLTYGVrrS 

f A Tiirc\7 A \n? A Cf D/im TJCTPVTRPMHTViPirT 
LAT W S VA Vr AolJrljri-r d 1 ^ i i cKPfn 1 1 v*rw i 

KYSLNSTimVI^SIJEINILGLVIPLGIMLFCY 

SKnmXLQHCKNEKKNKAVKMIFAVVVLFLG 

FWTPYNiyLFLETLVELEVLQDCTFERYLDYA 

IQATETLAFVHCCLNPEYFFLGEKFRKYILQL 

FKTCRGLFVLCQYCGLLQIYSADTPSSSYTQS 

TMDHDLHDAL 


987 


2337 


A 


8326 


3 


470 


slsamrflaatflllalstaaqaepvqfkdc 
gsvdgvikevnvspcptqpcx}lsk:gqsysvn 
vtftsniqsksskawhgilmgvpvpfpipbpd 

nrv ^fJTMrPTOKDKTYSYLNKLPVKSEYPSDC 
LWEWQLQDDKNQSLFCWEDPVQIVSHL 


988 


2338 


A 


8335 


1205 


323 


\nVMAT AAUTLPOFLHSRSLPCGAVRLRTFA 

VAEVRLPSATLCYFCRCRLGLGAALFPRSAR 

ALAASALPAQGSRWPVLSSPGLPAAFASFPAC 

PQRSYSTEEia^QHQKTECMIVLGFSNPINWV 

RTRKAFLIWAYFDKEFSITEFSEGAKQAFAH 

VSKLLSQCKFDLLEELVAKEVLHALKEKVTS 

LPDNHKNALAANIDEIVFTSTGDISIYYDEKG 

RKFVNIUICFWYLTSANIPSETLRGASVFQVK 

LGNQNVETKQLLSASYEFQREFTQGVKPDWT 

IARIEHSKLLE . 


989 


2339 


A 


8349 


67 


185 


l^SGJ?lriQLLIQ>ll*CVYHTRLkTSQGLCtLSL 
KSLHPMS 


990 


2340 


A 


8361 


210 


1115 


ASPFLRPQGHDSGEREPFSQTPGLMQPFSIPYQ 

ITLQGSRRRQGRTAFPASGKKRETDYSDGDPL 

DVHKRIJPSSTGEDRAVNILGFAMMGFSVLMF 

FLIX}TTILKPFMLSIQRBESTCTA1HTDD4DDW 

LDCAFTCGVHCHGQGKYPCLQVFVNLSHPG 

QKALLHYNEEAVQINFKCFYrPKCHQDRNDL 

LNSALDKEFFDHKNGTPFSCFYSPASQSEDVI 
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i SEQID ; 
NO: of 1 
nucl- 1 
eotide 

1 seq- 
uence 


SEQID I 
^0: of \ 
)eptide 
seq- 
uence 


Viet 1 J 
lod 1 J 

1 


3EQ ] 
nr\ "Kin* 

n 

USSN 
09/496 
914 


Predicted 

nucleotide 
ocation 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
coiTcsponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aianine C=Cysteme, 
I>Aspartic Acid, EKjlutamic Acid, 
F«Phenylalanine, GOlycine, H-Histidine, 
I-Isoleudnc, K=Lysinc, l>Lcucinc, 
M=Mcthioninc, N«Asparatfnc P^Proline, 
Q=Ghjtamine, R-Argininc. S»Serin^ 
T=Thrconine, V^Valinc, W-Tiyptophan, 
Y=Tyrosine, X=Unlcnown, ♦=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














LQCKYDQMAJFHCLFWPSLTLLGGALIVGMV 
RLTQHLSLLCEKYSTWRDEVGOKVPYEEQH 


991 


2341 


A 


8369 


9 


921 


SSWEFSALSVSMACLSPSQLQKFQQUOFLVL 

EGFLSAEECVAMQQRIOEIVAEMDVPLHCRT 

EFSTQEEEQLRAQGSTDYFLSSGDKIRFFFEK 

GVFDEKGNFLVPPEKSINKIGHALHAHDPVFK 

SrrHSFKVQUARSLGLQMPVWQSMYlFKQP 

HFGGEVSPHQDASFLYTEPLORVLGVWIAVE 

DATLENGCLWFIPGSHTSGVSRRMVRAPVOS 

APGTSFLOSEPARDNSLFVPTPVQRGALVLIH 

PtFVVHKSKONLSDRSRQAYTPHLMEASGTT 

WSPRNWLOPTAELPFPQLYT 


U92 


2342 


A 


8370 


906 


4 


MALSGNCSRYYPREQGSAVPNSFPEWBLNV 

GGQVYFTRHSTLISIPHSLLWKMFSPKRDTAN 

DLATOSKGRFFIDRDGFLFRYILJJYLRDRQYV 

LPDHFPEKGRLKREAEYFQLPDLVKLLTPDEI 

KQSPDEFCHSDFEDASQGSCmUCPPSSLLPAD 

RKWGFITVGYRGSCTLGREGQADAKFRRVPR 

ILVCGRISLAKEVFGETLNESRDPDRAPERYTS 

RFYLKFKHLMGAPASNFILGFWGLOQNQDK 

HPVNIYLQQRSVIRPDLTSKKAGDLKGKGDA 

OKVSRRRRWLGDPEHL 


993 


2343 


A 


8379 


1 


2794 


MRMQRHKNDTMDFGDSGKRIGGGVLCLLHVi 

SNTSFIKLNNNGFEDrVIVIDPSVPEDEKIIEQIE 

DMVTTASTYLFEATEKRFFFKNVSILIPENWK 

ENPQYKRPKHENHKHADVIVAPPTLPGRDEP 

YTKQFTECGEKGEYIHFTPDLLLOKKQNBYG 

PPGKLFVHEWAHLRWGVFDEYNEDQPFYRA 

KSKKIEATRCSAGISGRNRVYKCQGGSCLSRA 

CRIDSTTICLYGKDCQFFPDKVQTCKASIMFM 

QSroSVVEFChffiKTBNQEAPSLQNIKCNFRST 

WEVISNSEDFKKTIPMVTPPPPPVFSLLKIRQRI 

VCLVLDKSGSMGOKDRLNRhflNQAAKHFLLQ 

TVENGSVA^GMVHFDSTATIVNKLIQIKSSDER 

NILMAGLPTYPLGGTSICSGKYAFQVIGELH 

SOLDGSEVULTDGEDNTASSODEVKQSGAI 

VroiALGRAADEAVffiMSKITGGSHFYVSDEA 

ONNGLIDAFGALTSGNTDLSQKSLQLESKOLT 

LNSNAWMhTOTVIIDSTVGKDTmnWNSLPP 

SISLWDPSGTIMENFTYDATSKMAYLSIPGTA 

KVGTWAYNLQAKANPETLTITVTSRAANSSV 

PPnVNAKMNKDVNSFPSPMIVYAEILQGYVP 

VLGANVTAFIESQNGHTEVLELLDNGAGADS 

FKNDGVYSRYFTAYTENGRYSLKVRAHGGA 

NTARLKLRPPLNRAAYIPGWWNGEIEANPP 

RPEIDElWrrLEDFSRTASGGAFWSQVPSL 

PLPDQYPPSQITDLDATVHEDKIILTWTAPGD 

MrnvfiifVORYIIRlSASILDLRDSFDDALQVN 

TTDLSPKEANSKBSFAFKPENISEENATHIFIAI 

KSIDKSNLTSKVSNIAQVTUFIPQANPDDIDPT 

PTPTFIFTPDKSHNSGVNISUVLSVIGSWIV 

NFELsrn 


994 

1 995 


2344 
2345 


A 
A 


8385 
1 8390 


231 
194 


'"644 
3421 


' TNSS^RDHQELNLmHRUSKSQRAVLKIP 
RONPGIFYWIFLPSRSHSASHOSRQRQVSCQG 
TODEIIJKMRNTFABLKNSLEALSSRMDQAEE 
RIGTQAGVQWRDHGSLQPQPPEFKQCFHLSL 

PSSWDYRACLS 
" "AWRKSSVWPR^TRRGEiCSUOUKSbQAJvlKK 
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SEQID £ 
NO: of t 
nucl- 1 
eotide s 
scq- \ 
uence 


;eqid ^ 

40: of h 
)eptide 
>eq- 
lence 


lod I 
i 
\ 
( 
< 


;eq 1 

DNO: 1 
n 1 
JSSN 
)9/496 

m 


Predicted 1 
beginning i 
lucleotide ] 
ocation < 
Mirespondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


=*rcdictcd end 
lucleotide 
ocation 
Mrresponding 
:o last amino 
acid residue 
of peptide 
sequence 


i\iiiino acid sequence (A^Aianine OCysteinCj 
>Aspartic Acid, &=Glutainic Acid, 
^Phenylalanine, G=GlyciaB, H=Histidine» 
NIsoleucine, K=Lysine, LpLeucine, 
M=Methionine, N=Asparagine» P^ftolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=»Tryptophan, 
Y«Tyrosine, X=Unknown. ♦^Stop codon, 
/^possible nucleotide deletion, \Fposablc 
nucleotide insertion 














DFLSMKQSPALAPEERCRRAGSPKPVLRADJU 

NNMGNGCSQKLATANLLRFLLLVLIPaCALV 

LLLEBLLSYVGTLQKVYFKSNGSEPLVTDGEI 

QGSDVILTNnYNQSTWSTAHPDQHVPAWT 

TDASLPGDQSHRNTSACMNITHSQCQNCLPYH 

ATLTPLLSVVRNMEMEKFLKFFTYLHRLSCY 

QHIMLFGCTLAFPECnDGDDSHGLLPCRSFCE 

AAKEGCESVLGMVNYSWPDFLRCSQFRNQT 

ESSNVSRICFSPQQENGK.QLLCGRGENFLCAS 

GICIPGKLQCNGYNDCDDWSDEAHCNCSENL 

FHCHTGKCLNYSLVCDGYDDCGDLSDEQNC 

DCNPTTEHRCGDGRCIAMBWVCDGDHDCVD 

KSDEVNCSCHSQGLVECRNGQCIPSTFQCDG 

DEDCKDGSDEENCSVIQTSCQEGDQRCLYNP 

CLDSCGGSSLCDPNNSLNNCSQCEPITLELCM 

NLPYNSTSYPNYFGHRTQKEASISWESSLFPA 

LVQTNCYKYLMFFSCTILVPKCDVNTGEHIPP 

CRALCEHSKERCESVLGIVGLQWPEDTDCSQ- 

FPEENSDNQTCLMPDEYVEECSPSHFKCRSGQ 

CVLASRRCDGQADCDDDSDEENCGCKERDL 

WECPSNKQCLKHTVICDQFPDCPDYMDEKN 

CSFCQDDELECANHACVSRDLWCDGEADCS 

»-»rt<-ri-kT->«nr\rvirTT CTXr\/TJQQd?T XrfVHRAATEHH 

DSSDEWDC V 1 Lu-li^ "i^^"^"^"* " ""^^ 
VCADGWQEILSQLACKQMGLGEPSVTKUQE 

OEKEPRWLTLHSNWESLNOmHELLVNGQS 

CESRSKISLLCTKQDCGRRPAARMNKRILGGR 

TSRPGRWPWQCSLQSEPSGHICGCVLIAKKW 

VLTVAHCFEGRENAAmCVVU3INNLDHPS 

VFMQTRFVKTIILHPRYSRAWDYDISIVELSE 

DISETGYVRPVCLPNPEQWLEPDTYCYITGW 

GHMGNKMPFKLQEGEVRUSLEHCQSYFDMK 

TTTTRMICAGYESGrVDSCMGDSGGPLVCEK 

PGGRWTLFGLTSWGSVCFSKVLGPGVYSNYS 

YFVEWKROIYIQTFLLN 


996 


2346 


A 


8392 


199 


3085 


KVILSSEMSKTNKSKSCibKSSRSRSAbKStoRS 

FSKSRSRSRSLSRSRKRRLSSRSRSRSYSPAHN 

RERNHPRVYQNRDFROHNRGYRRPYYFRGR 

NRGFYPWGQYNROGYGNYRSNWQNYRQAY 

SPRRGRSRSRSPKRRSPSPRSRSHSRNSDKSSS 

DRSRRSSSSRSSSNHSRVESSKRKSAKEKKSSS 

KDSRPSQAAGDNQGDEVKEQTFSGGTSQDTK 

ASESSKPWPDATYGTQSASRASAVSELSPRER 

SPALKSPLQSVWRRRSPRPSPVPKPSPPLSST 

SQMGSTLPSGAGYQSGTHQGQFDHGSGSLSP 

SKKSPVGKSPPSTGSTYGSSQKEESAASQGAA 

YTKRYmQKTENGKDKEQKCyrNTOKEKIKE 

KGSFSDTGLODGKMKSDSFAPKTDSEKPFRG 

SOSPKRYKLRDDFEKKMADFHKEEMDDQDK 

DKAKGRKESEFDDEPKFMSKVIGANKNQEEE 

KSGKWEGLVYAPPGKEKQRKTEbLbbiibt'rii 

RSKKEDRQKRSEGGHROFVPEKNFRVTAYK 

AVQEKSSSPPPRKTSESRDKLGAKGDFPTGKS 

SFSITREAQVNVRMDSFDEDLARPSGLLAQER 

KLCRDLVHSNKKEQEFRSIFQHIQSAQSQRSP 

SEU?AQHIVTIVHHVKEHHFGSSGMTLHERFT 

KYLKRGTEQEAAKNKKSPEIHRRIDISPSTFRK 

HGLAHDEMKSPREPOYKAEGKYKDDPVDLR 

LDIERRKKHKERDLKRGKSRESVDSRDSSHSR 

ERSAEKIEKTMCGSKKQKKHRRARDRSR^S 

SSSQSSHSYKAEEYTEETEEREESTTGFDKSRL 
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SEQID J 
NO: of 1 
nucl- ] 
eotide 
seq- 
uenoe 

i 


SEQID 1 
viO: of 1 
peptide 
seq- 
iieace 


Viet J 
lod ] 


3EQ 
[DNO: 

ID 

09/496 
914 


Predicted 

|)eginning 

nucleotide 

ioc&tiou 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
r/Yniesnondin& 
to last amino 
acid residue 
of peptide 


Amino acid sequence (A=Aianme OCysteme, 
[>=Aspartic Acid, &=Glutamic Acid, 
F«=Phenylalanine, GKJlydne, H»Histidine, 
I^Isolcudnc, KpLysinc, L=Lcocinc, 
M=Mediionine, N=Asparaginc, P*=Prolm^ 
Q=Glutamlne, R=Arginine. S=Serine, 
T=Threonine, V^Valinc, W«»Tiyptophan, 
Y=Tyiosine, X^Unknown, ♦^top codon, 
/^possible nucleotide deletion, \=T)ossible 
nucleotide insertion 














GTKDFVGPSERGC50RARGTFQFRARGRGWO 
RGNYSGNNNNNSNNDFQKRNREEEWDPEYT 
PKSKKYYLHDDREGEGSDKWVSRGRGRGAF 
PRGRGRFMFRKSSTSFKWAHDKFSGEEGEIE 
DDESGTENRBEKDNIQPTTE 


997 


2347 


A 


8398 


202 


552 


CPALGGRQDLQGTRLLWAHDSGVGGQKAKJS 
KQENLESLEATGREEEGGQGPPVTTTCGVLLA 
LLMAGLALQPGTALLCYSCKAQVSNEDCLQ 
VENCrOLGEOCWTARIREWGDDSRQA 


998 


2348 


A 


8400 


697 


301 


NPPSACTPGSCDSCSGRGRDLAFDSYW5TNW 
MSDPRRPNKVLRYKPPPSECNPALDDPTPDY 
MNLLGMIFSMCGLMLKLKWCAWVAVYCSFl 
SFANSRSSEDTKQMMSSFMLSISAVVMSYLQ 

"VTD/^'DX iTTDD W 


999 


2349 


A 


8401 


93 


1126 • 


ASASHITSGHLRCFPGSEGVGTMARCFSLVLL 

LTSIWTTRLLVQGSLRAEELSIQVSCRIMGnL 

VSKKANQQLNFTEAKEACRLLGLSLAGKDQ 

VETALKASFETCSYGWVGDGFVVISRISPNPK 

CGKNGVGVLIWKVPVSRQFAAYCYNSSDTW 

TNSCIPEnTTKDPIFNTQTATQTTEFIVSDSTYS 

VASPYSTIPAPTTIPPAPASTSIPRRiaCUCVTE 

WMETSTMSTETEPFVENKAAFKNEAAGFGG 

vPTAT T VT ALLFFGAAAGLGFCYVKRYVKAF 

PFimNQQKEMIETTCVVKEEKANDSNPNEES 

KKTDKNPEESKSPSKTTMRCLEAEV 


1000 


2350 


A 


8406 


2 


777 


KERCQFWKPMLSTVGSaQDLQNEDKGlKT 
AAIFTADGNMISASTLMDILLMNDFKLVINKI 
AYDVQCPKREKPSNEHTAEMEHMKSLVHRL 
FTILHLEESQKKREHHLLEKIDHLKEQLQPLE 
QVKAGIEAHSEAKTSGLLWAGLALLSIQGGA 
LAWLTWWVYSWDJMEPVTYFITFANSNfVFF 
AYFrVTRQPYTYSAVKSRQFLQFFHKKSKQQ 
HFDVQQYNKLKEDLAKAKESLKQARHSLCL 

OMOVEELNEKN 


1001 


2351 


A 


8410 


1400 


264 


"VGFWERPLRSSRWFRRSLRRWEMLARAAKU 
TfiALLIJiGSlXASGRAPWlASSGlJPRKrVVLF 
VPQQEAWWERMGRFHRILEPGLNILIPVLDR 
IRYVQSLKEIVINVPEQSAVTLDNVTLQIDGV 
LYLRIMDPYKASYGVEDPEYAVTQLAQTTM 
RSELGKLSLDKVFRERESLNASIVDAINQAAD 
CWGIRCLRYEIKDIHVPPRVKESMQMQVEAE 
RRKRATVLESEGTRESAINVAEGKKQAQILAS 
EAEKAEQINQAAGEASAVLAKAKAKAEAIRI 
LAAALTQHNGDAAASLTVAEQYVSAFSKLA 
KDSNTILLPSNPODVTSMVAQAMGVYGALT 
KAPVPGTPDSLSSGSSRDVQGTOASLDEELDR 

VKMS 


1002 


2352 


A 


8421 


134 


941 


' ■NRENLLESRMMDPCSVGVQLRTTNECHK.nf 

•wwrnnTrmiTTrfrT ACT CGMT^IUfl T T OX PTfiMTLSQ 

YTRHTGFKi JjQbLbbI>il/MLJLiL»^^L»ivi vjivi i i-iuvj 

NNTICFHHVKIYIDRFEDLQKSCCDPFNIHKKL 

AKKNLHVIDLDDATFLSAKFGRQLVPGWCLC 

PKCTQUNGSVDVDTEDRQKRKPESDGRTAK 

ALRSLQFTNPGRQTEFAPETGKREKRRLTKN 

ATAGSDRQVIPAKSKVYDSQGLLIFSGMDLC 

DCLDEDCLGCFYACPACGSTKCOAECRCDRK 

WLYEOIEIEOOEIIHNKHAQ 


1003 


2353 


A . 


8427 


3 


1416 


' "TEWGLSGSCPGCSPLEPOSRGRGAAAWKLLK 
CRRLPEPSPFLTQPNLAQSQPPAPVPVTDPSVT 
MHPAVFLSLPDlACSLLaVTWVFTPVTTErr 
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SEQID 
NO: of 1 
nucl- 
eotide 
seq. 
uence 


SEQID 1 
t^Oiof 
peptide 
seq- 
uence 


Vtet 
tiod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conrespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alaiune OKJystcine, 
D=Aspartic Acid, E=<31utainic Acid, 
F^Phenylalanine. G=<jlyciDe; H«=Histidme, 
Msoleucine, K=4.ysine, L«Lcucine> 
M=Mcthionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W-Tiyptophan, 
Y^Tyrosine. X«Unknown, *'=Stop codon, 
/^possible nucleotide deletion, \r=possiblc 
nucleotide insertion 














nTRNmpn nnadvalvnfyadwcrfsqm 

LHPIFEEASDVIKEEFPNENQVVFARVDCDQH 

SDIAQRYWSKYPTLKmiNGMMMKREYRGQ 

RSVKALADYIRQQKSDPIQEIRDLAEITTLDRS 

KRNnOYFEQKDSDNYRVFBRVANILHDDCAF 

LSAFGDVSKPERYSGDNUYKPPGHSAPDMVY 

T ^ AK/fTMpnVTVTsJWTODKCVPLVRErrFENGE 

ELTEEGLPFLILFHMKEDTESLEIFQNEVARQL 

ISEKGTINFLHADCDKFRHPLLHIQKTPADCP 

VIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDL 

HSGKLHREFHHGPDPTDTAPGEQAQDVASSP 

nceccr^vT ADQCVTJVTT T HDRD'EL 
PEobrvlvL^Aroiii 1 iJUJUivi-'isi^xiJ^ 


1004 


2354. 


A 


8432 


910 


387 


GLSRKLRAGFLPGFCRVSPCGSWWETLVKJVl 
ACAAARSPADQDRHCIYPAYLNNKKTIAEGR 
RIPISKAVENPTATEIQDVCSAVGLNVFLEKN 

T^iL*vc<'DT?nrKTD'ri'\/riVUrtT?VRVnT TCOCDGSLC 
KMYSREWNKLJ Vl^ I itOKViv V ^^Lirwv^&L/'JtJA-*' 

LVQFPSRKSVMLYAAEMIPKLKTRTQKTX3GA 
DOSLOOGEGSKKGKGKKKK 


1005 


2355 


A 


8453 


90 


530 


OSHKTKMQSGTHWRVLOLCLLSVGVWGQD 
GNBEMGGITQTPYKVSISGnYILTCPQYPGSE 
ILWQHNDKNIGGDEDDKNIGSDEDHLSLKEF 
OCT cncnvvvrvPRG^TCPEDANFYLYLRARG 
NPGLONRYHRLFREDHSKGHSO 


1006 


2356 


A 


8458 


3 


307 . 


AVQRIRHEMNIFRLTGDLSHLAAIVILLLKl W 

KTRSCAGISGKSQLLFALVFTTRYLDLFTSFIS 

I vrvT'ro\irv'\7'a7V'Anro'NVFHT orTGLWTLNLC 
LYN i^MixV W I AiiiKiN vrni-»x^ ij-rt.i*/v/ 

OLCIFN 


1007 


2357 


A 


8459 


43 


553 


GAGAGGDWAAMDKLKJCVLSGQDTEDRSUL , 

SEWEASSLSWSTRIKGFIACFAIGEX:SLLGT 

VLLWVPRKGLHLFAVFYTFGNIASIGSTIFLM 

GP\^QLKRMFEPTRLIATIMVLLCFALTLCSA 

FWWHNKGLALIFCILQSLALTWYSLSFIPFAR 

DAVKKCFAVCLA 


1008 


2358 


A 


8462 


487 


150 


AQDIRSVHSLGQKSTFVKHtK'rLSHLHGLl'uk' 
TmirwrDDrkT3D QPPQUPr^MPSHRPOIPOLSNSGPS 
DPRWGCVGPSMPTSTCLPGAVEASTTKASLP 
KCPVDSSLPTPEACFL 


1009 


2359 


A 


8465 


134 


954 


ETRVKTSLELLRTQLEPTGTVGNnMTSgi'Vf 
NETIIVLPSNVINFSQAEKPEPINQGQDSLKKH 
1 uAPurvTHTTOnXIGMMVLSLGnLASASFSFN 
FTQVTSTLLNSAYPnGPFFFnSGSLSIATEKRL 
TKLLVHSSLVGSILSALSALVQFnLSVKQATL 
NPASLQCELDKNNIPTOSYVSYFYHDSLYTn) 
Cn"AKASlAGTLSU^CTIXEFCLAVLTAVL 
RWKOAYSDFPGSVLFLPHSYIGNSGMSSKMT 
HDCGYEELLTS 


1010 


2360 


A 


8468 


2 


473 


KYRYRRPYPVMRKICQVGPAGLAFILNISI'VA 
HRVALCHLAGCQEQAAWYHTLQILFFLVSAY 
FFSCPVPEKYFPGSCDIVGHGHQJFHAfi-alU 1 
LSQLEAILLDYQGRQEIFLQRHGPLSVHMACL 
SFFFLAACSAATAALLRHKVKARLTKKDS 


1011 


2361 


A 


8478 


5 


409 


' ■ mSQLEKAHPPADMGRRKSKRKPPPKKKMT 
GTLETQFTCPFCNHEKSCDVKMDRARNTGVI 
SCrVCLEEFQTPrrCILGNLGFFQRVGRGLESG 
PCSSGPLCALVQGQSRPEEQVPPSDFOGVRRC 

RAGFOCO 


1012 


2362 


A 


8481 


2810 


1652 


' iTSTQKWQSVFNDSQEHLERFYCNPENDKM 
RMKYOGQEFWADLNAMNVYETTEFDQLRR 
LSTPPSSNVNSIYHTVWKFFCRDHFGWREYPE 
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SEQID 
I NO: of 

nucl- 
I eotide 
I seq- 
uence 



SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



2363 



2364 



2365 



SEQ 

I0NO: 

in 

USSN 
09/496 
914 



Predicted 

beginning 

nucleotide 

locatioD 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 



1016 



1017 



2366 



2367 



8488 



8501 



8504 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



363 



8511 



8513 



54 



517 



17 



2190 



453 



1196 



Amino acid sequence (A=Alanine C=Cysteine, 
I>»Aspartic Acid, E^lutamic Acid, 
F=Phenylalanine, GKilycine, H=^Histidine, 
I=Isole«dnc, K«Lysinc, L=Leucinc, 
M=Methiomhc, N=Aspaiagine, P=Proline, 
Q=Glutamine, R^Argininc, S«Serinc, 
T=Thrconine, V^Valinc, W'=Tryptophan, 
Y=Tyrosine, X=Unknown, ♦^Stop codon, 
A^ssible nucleotide deletion, V^possible 

nucleotide insertion . 

SVmUEEANSRGLKEVRFMMWNNHYlLHNS 

FPRREDCWOIJMCFILLPYLQTUjGVPTQAP 

PPLEATSSSQBCPDOVTSANFYPETWVYMHP 

SQDFIQWVSAEDKSYRIIYNLFHKTVPEFKYR 

ILQILRVQNQFLWEKYKRKKEYMNRKMFGR 

DRTOJERHLFHGTSQDVVDGICKHNFDPRVCG 

KHATVfFGQGSYFAKKASYSHNFSKKSSKGV 

HFMFU^VLTGRYTMGSHGMRRPPPVNPGS 

VTSDLYDSCVDNFFEPQIFVTFNDDQSYPYFVI 

OY EEVSNTVSl 

1DENCRTRLRQAWHEVCGNKMAAPIPQGFSCL 
SRFLGWWFRQPVLVTQSAAIVPVRTKKRFrP 
PIYQPKFKTCKEFMQHARKAGLVIPPEKSDRS 
MLACTAGIFDAYVPPEGDARISSLSKEGUER 
TERMKKTMASQVSIRRIKDYDANFKIKDFPE 
KAKDIFIEGSPLY 



"YIRTGYVYICnYAQtMYTYYiKTAYVYIClLY 
AQLNTYITS^YTHSLCIHMYSIRTAYVYICIIY 
AQlMYTYVFYTHRLCfflMYSlRTDYVYICELY 

AOLMY TYVFYTOSYMSDE - 

NSSEHFSQAPQRLSFYSWyOSARLFRFRVPPD 



AVLLRWLLQVSRESGAACTDAEITVHFRSGA 
PPVINPLGTSFPDDTAVQPSFQVGVPLSTTPRS 
NASVNVSHPAPGDWFVAAHLPPSSQKIELKG 
LAPTCAYVFQPBLLVTRWEISIMEPDVPLPQ 
TLI^HPSYLKVFVPDYTRELLLELRDCVSKGS 
LGCPVRLTVGPVTLPSNFQKVLTCTGAPWPC 
RLLLPSPPWDRWLQVTAESLVGPLGTVAFSA 
VAALTACRPRSVTIQPLLQSSQNQSFNASSGL 
LSPSPDHQDLGRSGRVDRSPFCLTNYPVTRED 
MDWSVHFQPLDRVSVRVCSDTPSVMRLRL 
NTGN©SGGSLTISIJ^ANKTEMRNETVVVACV 
NAASPFLGFNTSLNCTTAFFQGYPLSLSAWSR 
RANLnPYPETDNWYLSLQLMCPENAEDCEQ 
AWHVETTLYLVPCLNDCGPYGQCLLLRRHS 
YLYASCSCKAGWRGWSCTDNSTAQTVAQQR 
AATLLLTLSNLMFLAPIAVSVRRFFLVEASVY 
AYTMFFSTPYHACDQPGEAVLCILSYDTLQY 
CDFLGSGAAIWVmCMARLKTVLKYVLFLL 
GTLVIAMSLQLDRRGMWNMLGPCLFAFVIM 
ASMWAYRCGHRRQCYPTSWQRWAFYLLPG 
VSMASVGIAIYTSMMTSDNYYYTHSrWHILX 
AGSAALLLPPPDQPAEPWACSQKFPCHYQIC 
KNDREELYAVT 



KWYPSOPVRIPGRFYYKLPAGHRRCRMAPAK 
KGGEKKKGRSAINEWTREYTINIHKRIHGVG 
FKKRAPRALKEIRKFAMKEMGTPDVRIDTOL 
NKAVWAKGDWVPYRIRVRLSRKRNEDEDSP 
Ts?KLYT LVTYVPVTTFKNLQTVNVDEN 
LERTPASADMAWTKYQLFLAOLMLVTQSlfST 



LSAKWADNFMAEGCGGSKEHSFQHPFLQAV 
GMFLGEFSCLAAFYLLRCRAAGQSDSS\nOPQ 
QPF^^>LIJ^PALCDMTGTSLMYVALNMTSA 
SSFOMLRGAVmTGLFSVAFLGRRLVLSQWL 
GILATlAGLVWGLADLLSKHDSQHKLSEVrr 
GDLLnMAQnVAIQMVLEEKFVYKHNVHPLR 
AVGTEGLFGFVILSLLLVPMYYIPAGSFSGNP 
RGTLEDALDAFCQVGQQPUAVALLGNISSIA 
FF^1FAGISVTKELSATTRMV1X>SLRTVVIWAL 
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SEQ ID 
NU: 01 
nucl- 
eotide 
seq- 
iieoce 


SEQ ID 
NO' nf 

peptide 
seq- 
uence 


Met 
hod 


SEQ 
10 NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

corrcspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of p^itide 
sequence 


Amino acid sequence (A=Alanine C=Cystcine, 
D=Aspartic Acid, &<3iutamic Acid, 
F^Phenylalanine, G=Glycine, H«Histidinc, 
I=Isoleucinc, K-Lysinc, Lf=Lcucinc, 
M=Methionine. N=Aspaiagine. P=Proline, 
Q=<jlutamine» R^Ai^ginine, S=Serine. 
T=Thrconine, V«Valinc, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possibl6 nucleotide deletion, V=possib!e 
nucleotide insertion 














SLALGWEAFHALQILGFLILLIGTALYNOLHR 
PLLGRLSRGRPLAEESEQERLLGGTRTPINDA 
S 


1018 


2368 


A 


8518 


324 


694 


SPFWTEKRRMEKPLFPLVPLHWFOFOYTALV 

VSGOIVGYVKTGSVPSLAAGLLFGSLAGLGA 

YQLYQDPRNVWGFLAATSVTFVGVMGMRS 

YYYGKFMFVGLIAGASLLMAAKVGVRMLM 

TSD 


1019 


2369 


A 


8526 


2 


1787 


VSAAAVNMEPPDAPAQARGAPRLLLLAVLL 

AAHPDAQAEVRLSVPPLVEVMRGKSVILDCT 

PTGTHDHYMLEWFLTDRSGARPRLASAEMQ 

GSELQVTMHDTRGRSPPYQLDSQGRLVLAEA 

QVGDERDYVCWRAGAAGTAEAAARLNVF 

AKPEATEVSPNKGTLSVMEDSAQEIATSNSRN 

GNPAPKITWYKNGQRLEVPVEMNreGYMTS 

RTVREASOLLSLTSTLYLRLRKDDRDASFHC 

FWVGSPSTPAOWVREGDTVQLLCRGDGSPSP 

EYTLFRLQDEQEEVLNVNLEGNLTLEGVTRG 

QSGTYGCRVEDYDAADDVQLSKTLELRVAY 

LDPLELSEGKVLSLPLNSRAWNCSVHGLPTP 

ALRWTKDSTPLGDGPMLSLSSITFDSNGTYVC 

EASLPTVPVLSRTQNFTLLVQGSPELKTAEIEP 

KADGSWREGDEVTLICSARGHPDPKLSWSQL 

GGSPAEPIPGRQGWVSSSLTLKVTSALSRDGI 

SCEASNPHGNKRHVFHFGTVSPQTSQAGVAV 

MAVAVSVGLLLLWAVFYCVRRKGGPCCRQ 

RREKGAP 


1020 


2370 


A 


8530 


2 


1200 


PRVRLLRPSRSRSCRGLLSTRAPGPSPFRSLHS 

<5P1 1 PHAMlcqpFYRCONTTSVEKGNSAVMOG 

VLFSTGLLGNLLALGLLARSGLGWCSRRPLR 

PLPSVFYMLVCGLTVTDLLGKCLLSPWLAA 

YAQNRSLRVLAPALDNSLCQAFAFFMSFFGL 

SSTLQLLAMALECWLSLOHPFFYRRHITLRLG 

ALVAPWSAFSLAFCALPFMGFGKFVQYCPG 

TWCFIQMVHEEGSLSVLGYSVLYSSLMALLV 

LATVLCNLGAMRNLYAMHRRLQRHPRSCTO 

DCAEPRADGREASPQPLEELDHLLLLALMTV 

LFTMCSLPVIYRAYYGAFKDVKEKNRTSEEA 

EDLRALRFLSVrSIVDPWIFIIFRSPVFRIFFHKI 

FIRPLRYRSRCSNSTNMESSL 


1021 


2371 


A 


8536 


1 


237 


RRGEIDMATEGDVELELETETSGPERPPEKPR 
KHDSGAADLERVTDYAEEKEIQSSNLETAMS 
VIGDRRSREQKAKQER 


1022 


2372 


A 


8537 


94 


541 


■ RKERRRRRRRMEAWFVFSLLDCCALIFLSV 
YFffTLSDUECDYINARSCCSKLNKWVIPELIG 
HTIVTVLLLMSLHWFIFLLNLPVATWNIYRYI 
MVPSGNMGVFDPTEIHNRGQLKSHMBCEAMI 
KLGFHLLCFFMYLYSMILALIND 


1023 


2373 


A 


8540 


26 


431 


"RMMKCPQALLAIFWLLLSWVSSEDKWQSPL 
SLVVHEGDTmNCSYEVTNFRSLLWYKQEK 
KAPTFLFMLTSSGIEKKSGRLSSILDKKELSSIL 
NITATQTGDSAIYLCAVEAQCSLVTCSLYSNS 
TAEALQL 


1024 


2374 


A 


8544 


1731 


743 


GVRLRYSPlAWMVGEAGRDLRRRRAVAVr 
AEKMAVLAPLIALVYSVPRLSRWLAQPYYIX 
SALLSAAFLLVRKLPPLCHOLPTQREDGNPCD . 
FDWREVEIUk4FLSArVMMKNRRSITVEQfflGN 
IFMFSKVANTILFFRLDIRMGLLYrrLCIVFLM 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID "NO: 
in 

USSN 
914 


Predicted 
beginning 
nucleotide 
location 

ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to l&st amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E^lutamic Acid, 
F«Phenylalaninc, 0«Glycine, H=Histidine, 
I=Isolcucinc, K=Lysine, L»=Lcucine, 
M«Methionine,N«Asparagine, P=Proline, 
Q=<jhitamine, R«Argminc, S«Serine, 
T»Thrconine, V=VaIine, W==Tryptophan, 
Y^Tyrosine, X=Unknown, ♦^Stop codon, 
/"possible nucleotide deletion, \=possiblc 
nucleotide insertion 














TCKPPLYMGPEYIKYFNDKTIDEELERDKRVT 

WIVEFFANWSNDOQSFAPIYADLSLKYNCTG 

LNFGKVDVGRYTDVSTHYKVSTSPLTKQLPT 

LILFQGGKEAMRRPQIDKKGRAVSWTFSEEN 

VIREFNLNELYQRAKKLSKAGDNIPEEQPVAS 

TPTTVSDGENKKDK 


1025 


2375 


A 


8546 


2194 


170/ 


TVSFHKTMASLKCSTVVCVICLEKPKYRCPA 
CRWYCSWCFRKHKEQCNPETOPVEKKIRS 
ALPTKTVKPVENKDDDDSIADFLNSDEBEDR 
VSLQNLKNIXJESATLRSLLLNPHLRQLMVNL 
noftFnKAKLMRAYMOEPLFVEFADCCLGIV 
EPSONEES 


1026 


237^. 


A 


8547 


1078 


594 


VGIVffiLPAVNLKVUXGHWLLTTWGCIWSUS 

YAWANFITLALGVWAVAQRDSIDAISMFLGG 

LLATIFLDIVHISIFYPRVSLTDTGRFGVGMAIL 

SLLLKPLSCO'VYHMYRERGOELLVHTGFLG 

SSQDRSAYQTIDSAEAPADPFAVPEGRSQDAR 

ay 


1027 


2377 


A 


8557 


I 


340 


DFLGPASPQEEGGSESSTMlHLttAMGMIIDV 
FSRYSGSEGSTQTLTICGELKVLMEKELPGFLQ 
SGKDKDAVDKLLKDLDANGDAQVDFSEFIVF 
VAAJTo Al./ JlK I rLiSJWJL^ 


1028 


2378 


A 


8569 


20 


963 


■ KMAATLGPLGSWQQWRRCLSARDGSRKLLL 
LLLLGSGQGPQQVGAGQTFEYLKREHSLSKP 
YQGEAPRPCFUU)WELQVHFKIHGQGKKNL 
HGDQlJaWYTKDRMQPGPVFGNMDKFVOLG 
VFVDTYPNEEKQQERVFPYISAMVNNGSLSY 
DHERDGRPTELGGCTAIVRNLHYDTFLVIRY 
vepm TlMMniDGKHEWRDCIEVPGVRLPRG 
YYFGTSSITGDLSDNHDVISLKLFELTVERTPE 
EEKLHRDVFLPSVDNMKLPEMTAPLPPLSGL 
ALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRK 
RFY 


1029 


2379 


A 


8572 


1 


578 


AAAASHRSRARSRPRRVSSGPAPRRAQSSAG 

RVASGLDSAPLCTh4ARALCRLPRRGLWLLLA 

HHLFMTTACQEANYGALLRELCLTQFQVDM 

EAVGETLWCTWGRTIRSYRELADCTWHMAE 

KLGCFWPNAEVDRFFLAVHGRYFRSCPISGR 

AVRDPPGSILYPFIWPrrVTLLVTALWWQS 

KRTEOrV 


1030 


2380 


A 


8574 


1352 


372 


DSSTVKGGSESRHLCUPDLKGKARTREASbU 

SRTCGRRTSLCTSAKSSWTYRSGRLSWQSIKG 

THLTITQALRQPLHRAPLLPGQLCWSPRPLEK 

NKAMORPLLLPLLLLLQPPAFLQPGGSTGSGP 

mYGVTQPKHLSASMGGSVEIPFSFYYPVVTEL 

AIVFNVRISWRRGHFHGQSFYSTRPPSIHKDY 

VNRLFLNWTEGQESGFUUSNLRKEDQSVYF 

OlVELDTRRSGRQQLQSIKGTKLTrrQAVTTT 

TTWRPSSTTTIAGLRYTESKGnobS WnLoLi^ i 

AIRVALAVAVLKTVE-GLLCLLLLWWRRRKG 

SRAPSSDF 


1031 


2381 


A 


8580 


905 


340 


RRTAGIYPCFPKPGRTRHALCSWLLLLTUQL 

AFDDFQESCAMMWQKYAGSRRSMPLGARIL 

FHGVFYAGGFAIVYYLIQKFHSRALYYKLAV 

EQLQSHPEAQEALGPPLNIHYLKLIDRENFVDI 

VDAKLKIPVSGSKSEGLLYVHSSRGGPFQRW 

HLDEVFLELKDGQQIPVFKLSGENGDEVKKE 


1032 


2382 


A 


8593 


2558 


961 


■ "RRRPRLLPGAEPCEPRVGPRRADMGCSAIIAK 
WAAGAI/3VAGUCAVI/1AVMIVMVPSLIKQ 
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1 SEQ ID : 
NO: of 1 
nucl- ] 
eotide 
seq- 
uence 


SEQBD 1 I 
^0: of 1 
leptide 1 
seq- 
uence 


Viet J 
lod 1 
1 

\ 


SEQ 1 
ID NO: \ 
n 1 
JSSN 
D9/496 
914 


Predicted T 

beginning 

nucleotide 

ocation 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


^edictedend 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Aiiiino acid sequence (A^Aianine OCystemc, 
D=Aspartic Acid, B=Glutamic Acid, 
F«Phenylalanine, CM}lycine, H=Histidine, 
l«Isoleucme, K«Lysine, L^Lcwcinc, 
M=Mcthioninc»N=Asparagine, Ps=Proline, 
Q=Glutaininc, R=Arginine, S=Serine, 
T«Threonine, V^Valine, W=Ttyptophan, 
Y«Tyrosine, X^Unknown, ^^Stop codon, 
/^possible nucleotide deletion, \Fpossible 
nucleotide insertion 














■bVUCNVRTOPSSLSFNMWKEIPIPFYLSVYri'U 

VMNPSEIUCGEKPQVRERGPYVYREFRHKSNI 

TFNNNDTVSFLEYRTFQFQPSKSHGSESDYIV 

MPNILVUSAAVMMENmmiCLlMT^^ 

GERAFMNRTVGEIMWGYKDPLVNLINKYFP 

GMFPFKDKFGLFAELNNSDSCH-FTGFTGVQNI 

SRIHLVDKWNGLSKVDFWHSDQCNMINGTS 

GQMWPPFMIPESSLEFYSPEA(»SMKLMYKE 

sgvfegiptyrfvapichjangsiyppnegfcp 
clesgiqnvstcrfsaplflshphflnadpvl 
aeavtglhpnqeahslfldihpvtgipmnc&v 
klqlslymksvagigqtgfaepwlpllwfa 

ESGAMEGETUinfYTQLVLMPKVMHYAQYV 
LLALGCVUXVPVICQIRSQEKCYLFWSSSKK 
nQifnifKAlOAYSESLMTSAPKGSVLQEAKL 


1033 


2383 


A 


8595 


595 


767 


"AHLPmLLPPHSPTVPTPKSFQCSQKACFSRS 
FaXLSLVSSSLVSLSLCPPLTOA 


1034 


2384 


A 


8597 


640 


164 


VtrsaiPFAFGLGVRASERIAEIDMPYLLK.Yl^ 
PMMQTIGQKYCMDPAVIAGVLSRKSPGDKIL 
VNMGDRTSMVQDPGSQAPTSWISESQVFQTT 
EVLTTRTrELQRRFPTWTPDQYLRGGLCAYSG 
GAGYVRSSQDLSmFO^DVURAKYLKRHG 

F 


1035 


2385 


A 


8603 


936 . 


204 


■ AMASTLEYSPSPLRRLVGPAAGFSRAAKAUL 
SWDPMAFFTGLWGPFTCVSRVLSHHCFSITG 
SLSAIQKMTRVRWDNSALGNSPYHRAPRCI 
HNTfKKNGVGKVGDQIli^GQKKKALIVG 
HCMPGPRMTPRFDSNNWLIEDNONPVGTRI 
KTPIFTSLRKREGEYSKVLAIAQNFV 


1036 


2386 


A 


8606 


1 


562 


PTRAHSFDLCCSPCRIWLLGREEAGEEFrSi'V 

TQYLQPRSPEECKMFACAKLACTPSLIRAGSR 

VAYRPISASVLSRPEASRTGEGSTVFNGAQNG 

VSQLIQREFQTSAISRDIDTAAKnGAOAATyG 

VAGSGAGIGTVFGSLnGYARNPSLKQQLFSY 

AILGFALSEAMGLFCLNtfVAFLILFAM 


1037 


2387 


A 


8615 


2 


2364 


"SPGPSLPESAESLDGSQKUHO'KGSCAtiFli' lUl 
GMVAHINNSRLKAKOVGQHDNAQNFGNQSF 
EELRAACLRKGELFEDPLFPAEPSSLGFKDLO 
PNSKNVQNISWQRPKI)IINNPLnMDOISPTDI 
CQGILGDCWLLAAIGSLTTCPKLLYRWPRG 
OSFKKNYAGIFHFQIWQFGQWVNVWDDRL 
FIKNDKLVFVHSTERSEFWSALLEJCAYAKLS 
GSYEALSGGSTMEGLEDFTGGVAQSFQLQRP 
PONLLRLLRKAVERSSLMOCSEEVTSDSELES 
MTDKMLVRGHAYSVTGLQDVHYRGKMETLI 
RVRNPWGRIEWNGAWSDSAREWEEVASDIQ 
MQIXHKTEDGEFWMSYQDFLNNFTLLEICNL 
TPDTLSGDYKSYWHTTFYEGSWRTGSSAGGC 
T»vTiiDryiT7\UTMPOFKISLPEGDDPEDDAEGNV 
WCTCLVALMQKNWRHARQQGAQLQTIOFV 
LYAVPKEFQNIQDVHLKKEFFTKYQDHGFSEI 
FTNSREVSSQLRLPPGEYniPSTTEPHRDADFL 
LRVFTEKHSESWELDEVNYAEQLQEEKVSED 
DMDQDFLHLFKIVAGEGKEIGVyELQRLLNR 
MAIKFKSFKTKGFGLDACRCMtf4LMDKDGSG 
KLGLLEFiaLWKKLKKWMDIFRECDQDHSGT 
LNSYEhm-VIEKAGIKLNNKVMQVLVARYA 
DDDUIDFDSnSCFliaXIMFTFFLTMDPKNT 
GHICLSLEQVLGEGWEGICRIAPACPSTPPPPS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
pq^tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«AIanine C=Cysteine, 
D=Aspartic Acid, EKjlutamic Acid, 
F«Phenylalanine, GKJIycine, H=Histidtne, 
I=IsoIeucinc, K^Lysinc, LpLcucine, 
M=Methionine, N=Aspaiagine, P^Proline, 
Q=Glutamine, R»Argininc, S=Scrinc. 
T=Threonine, V=Valine, W«Tryptophan, 
Y=Tyiosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide inseitiofi 

SDVPGPASCPRLFPPWDLLPVSTVAADDHVGl 


1038 


2388 


A 


8621 


3 


1494 


EAL 

rsrmarapiJgvllllgllgrgvgkneelrly 
hhlfnnydpgsrpvmedtvteslkvtltnl 
islnekeetlttsvwigrowqdyrlnyskddf 
ggietuivpselvwlpeivlennidgqfgvay 

nATJVT VYFfiGSVTWLPPAIYRSVCAVEVTYF 

pfdwqncslifrsqtynaeeveftfavdndg 

ktinkjdidteaytengewaidfcpgvirrhh 

ggatogpgetdyiyslnrwcplfyviniivpcv 

usglvllayflpaqaggqkctvsinvllaqt 

vflfliaqkipetslsvpagrfufvmwa'iu 

vmncvivlnvsqrtptthamsprlrhvllel 

lprllgsppppeapraaspprrassvglllrae 

elilkkprselvfegqrhrqgtwtaafcqsl 

gaaapevrccvdavnfvaestrdqeatgee 

v<5nwvilMONALDNICFWAALVLFSVGSSLIF 
LGAYFNRVPDLPYAPaQP 


t039 


2389 


A 


8636 


1 


900 


"pgrerpggggarrrpqhlpallpserpdgatl 

QAMENELPVPHTSSSACATSSTSGASSSSGCN 

nsssggsgrptgpqisvysgipdrqtvqviqq 
alhrqpstaaqylqqmyaaqqqhlmlqta 
alqqqhlssaqlqslaavqqaslvsnrqgst 

SGSNVSAQAPAQSSSINLAASPAAAQLLNRA 

qsvnsaaasgiaqqavllgntsspaltasqa 
qmylraqmliftptatvatvqpelgtgspar 
pptpaqvqnltlrtqqtpaaaasgptptqpvl 
pslalkptpogsqplptpa 


1040 


2390 


A 


8645 


98 


1388 


■ asqlafggkltstpsrdfqgcgrgavtccsf 
hehrhqsgrclstgmapnlkgrprkkkpcpq 

ppnQircrsWDSNNNSDGKAVAKVKCEARSA 
LTKPKin^CKKVSNEEKFICVAlGEECRADE 

qaflvalykymkerktpieripylqfkqinlw 

tmfqaaqklggyetitarrqwkhiydelgg 

npgstsaatctrrhyerulpyerfikgeedkp 

lppkprkqenssqenenktkvsgtkrikheip 

kskkekenapkpqdaaevsseqekeqbtlisq 

ksipeplpaadmkkkiegyqefsakplasrvd 

pekdnerrdqgsnsekvaeeagekoptpplpsa 

plapekdsalvpoaskqpltspsalvdskqes 

klccftespesepqeasfprlphhtghrwq'm 

mrrrmtncppwqitlptap 


1041 


2391 


A 


8646 


113 


1492 


'LLQEMCTKITPVLWGCFLLWNLYVSSSQTIYP 

gikarn'qraldygvqa(»4km3eqmlkekk 
lpdlsgsesleflkvdyvnynfsnikisafsfp 

>n'SLAFVPGVGIKALTNHGTANlSTDWGFESP 

lfvlynsfaepmeia'ilknlnemlcpiiasevk 
alnanlstlevltkidnytlldyslisspeite 

NYLDLNLKGVFYPLENLTDPPFSPVPFVU'liK 

snsmlyigiaeyffksasfahftagvfnvtls 
teeisnhfvqnsqglgnvlsriaeiyilsqpfm 

VRIMATmiINU}PG>IFTlI)IPASIMMLTQPK 

nstvetivsmdfvastsvglvilgqrlvcsls 
lnrfrlaijesnrsnffivlrfenilssilhfgvl 
planaklqqgfplpnphkflfvnsdievlegf 
llistdlkyetsskqqpsfhvweglniisrqw 

RGK.SAP 


1042 


2392 


A 


8672 


538 


170 


■ ARRIARTRESKAAVSQDNVPALQPGKK1UU.K 
LGGKKKKFKFFRlJKEFKKOUviySPSNFKK^ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seQ- 
uence 


ceo Tn 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

[DNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanme C=Cysteme, 
I>Aspartic Acid, E^lutamicAcid, 
F«Phenylalanine, G=Glycine, H=Histidine, 
I^Isoleucine, K=Lysine, IM^cinc, 
M=Mcdiionine, N=Asparaginc, P=Prolinc, 
Q<fhitainine, R=Arginine, S=Serine. 
T=Thrconine, V^Valine, W=Tiyptophan, 
Y=Tyn)sine, X=Unknown, ♦=Stop codon, 
/^possible nucleotide deletion, \=possible 
nucleotide insertion 














TSLAGNTVQCLNKLKYVIYSAQYPAYUNlli 
LDMrrSTDHVLEQDFWlCFTFYSVKERQI 


1043 


2393 


A 


8688 


359 


17 


GLKTRAPATPTFQREVLGPAKQDMQRRCPRJ 
GLMTSLLKPDCRRWRDYKRWKSGGFTGESC 
HHAmiGDRGGLQODHSHXQWQKRILRTB 
GEPSPKYISKNIFPICSYrrGFL 


1044 


2394 


A 


8718 


292 


1490 


GTVKTSVATPITAGHSCSSGGVLQVKSPATQiS 

GFKFTSKMEDFNMESDSFPDFWKGEDLSNYS 

YSSTLPPFUDAAPCEPESLEINKYFWHYAL 

VFLLSLLGNSLVMLVTLYSRVORSVTDVYLL 

NLALADLLFALTLPlWAASKVNGWIFGmC 

KWSLLKEVNFYSGILLLAaSVDRYLAIVHA 

TRTLTQKRYLVKFICLSIWGLSLLLALPVLLFR 

RTVYSSNVSPACYEDMGNNTANWRMLLRIL 

PQSFGnVPLLIMLFCYGFTLRTLFKAHMGQK 

HRAMRVIFAVVUFIXCWIPYNLVLLADTLM 

RTQVIQETCERRNHIDRALDATEILGILHSCLN 

PLIYAHGQKFRHGIJLiaiAfflGLISKDSLPKDS 

RPSFVGSSSGHTSTTT- 


1045 


2395 


A 


8724 


254 


"3184 


-"FiB^l^AITVANRfeGAQGGll^TCCPPV ILbQ 
DLHRKMHSWMLQTLAFAVTSLVLSCAETIDY 
YOEICDNACPCEEKDGILTVSCENRGIISLSEIS 
PPRFPIYHLLLSGNLLNRLYPNEFVNYTGASEL 
HLGSNVIQDIETGAFHGLRGLRRLHLNNNKL 
ELLRDDTFLGLENLEYLQVDYNYISVIEPNAF 
GKLHIXQVLIU^NLLSSLPNNLFRFVPLTHL 
DLRGNRLKLLPYVGLLQHMDKWELQLEEN 
PWNCSCELISLKDWLDSISYSALVGDWCETP 
FRLHGRDLDEVSKQELCPRRLISDYEMRPQTP 
LSTTGYLHTTPASVNSVATSSSAVYKPPLKPP 
KGTRQPNKPRVRPTSRQPSKDLGYSNYGPSIA 
YQTKSPVPLECPTACSCNLQISDLGLNVNCQE 
RKIESIAELQPKPYNPKKMYLTENYIAVVRRT 
DLLEATOLDLLHLGNNRISMIQDRAFGDLTN 
LRRLYLNGNRIERLSPELFYGLQSLQYLFLQY 
>OREIQSGTFDPVPNLQUJnLNNNLLQAMPS 
GVFSGLTLLRLNLRSNHFTSLPVSGVLDQLKS 
LIQIDLHDNPWDCTCDIVGMiaWVEQLKVG 
VLVDE\aCKAPKKFAETDMRSIKSELLCPDYS 
DVWSTPTPSSIQVPARTSAVTPAVRLNSTGA 
PASLGAGGGASSVPLSVLILSLLLVFIMSVFVA 
AGLFVLVMKRRKKNQSDKTSTNNSDVSSFN 
MQYSVYGGGGGTGGHPHAHVHHRGPALPK 
VKTPAGHVYEYIPHPLGHMCKNPIYRSREGN 

SVEDYKDLHELKVTYSSNHHLQQQQQPPPPP 

QQPQQQPPPQLQLQPGEEERRESHHLRSPAYS 

VSTIEPREDLLSPVQDADRFYRGILEPDKHCST 

TPAGNSLPEYPKFPCSPAAYTFSPNYDLRRPH 

nVT HPnAGDSRLREPVLYSPPSAVFVEPNRNE 

YLELKAKLNVEPDYLEVLEKOTTFSQF 


1046 


2396 


A 


8736 


28 


452 


■ SPSAAGGLAWVSLALGSGSRGRDHSGSClVUl 
AMAGALVRKAADYVRSKDFRDYLMSTHFW 
GPVANWGLPIAAINDMKKSPEnSGRMTFALC 
CYSLTFMRFAYKVQPRNWLLFACHATNBVA 
OLIOGGRLIKHEMTKTASA 


1047 
1048 


2397 
■ 2398 


A 
A 


8741 
8747 


673 
3 


924 
5054 


■ AUK}TPQQTVTLNTDOKVKSFTSPHSNPNLi'i' 
AKFFTSLQSLNWSSHLPPSPATESVGKRGNAK 

PPTTKLLHSSPLWNFFAQQL 

■ PE^^TKPSLSQPTAASPIGSSPSPPVNGGNNAKR 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine 0=Cysteine, 
D=»Aspartic Acid, EMjlutamic Acid, 
F=Phenylalanine, GKjlycine, H=Histidine, 
Hsolcudnc, K«Lysine, L=Lcucine, 
M=Methionine, N=Asparaginc, P=Pioline, 
Q=Glutainine, R=Arginine, S=Serine, 
T^^Threonine, V=Valine, W«Tiyptophan, 
y»Tyrosine, X^Unknown, *=Stop codon, 
^=possible nucleotide deletion, V=possible 
nucleotide insertion 














VAVPNGQPPSAARYMPREVPPRFRCQQDHK ' 

VLLKRGQPPPPSCMXGGOAGPPPCTAPGAN 

PNNAQVTGALLQSESGTAPDSUOGAAASNY 

ANSTWGSGASSNNGTSPNPIHIWDKVIVDGS 

DMEEWPCIASKDTESSSEOTTDNNSASNPGSE 

KSTLPGSTTSNKGKGSQCQSASSGNECNLOV 

WKSDPKAKSVQSSNSrreNNNGLGNWRNVS 

GQDRIGPGSGFSNFNPNSNPSAWPALVQEOTS 

RKGALETDNSNSSAQVSTVGQTSREQQSKME 

NAGVNFWSGREQAQIHNTDGPKNGNTNSL 

NLSSPNPMENKGMPFGMGLGNTSRSTDAPSQ 

STGDRKTGSVGSWGAARGPSGTDTVSGQSNS 

GNNGNNGKEREDSWKGASVQKSTGSKNDS 

WDNNNRSTGGSWNFGPQDSNDNKWGEGNK 

MTSGVSQGEWKQPTGSDELKIGEWSGPNQPN 

SSTGAWDNQKGHPLLENQGNAQAPCWGRSS 

SSTGSEVEGQSTGSNHKAGSSDSHNSGRRSY 

RF3OTDCQAVLQ1XLSRTDLDPRVLSNTGWG 

QTQIKQmVWDIEEVPRPEGKSDKGTEGWES 

AATQIXNSGGWGDAPSQSNQMKSGWGELS 

ASTEWKDPKNTGGWNDYKNNNSSNWGGGR 

PDEKTPSSWNENPSKDQGWGGGRQPNQGWS 

SGKNGWGEEVDQTKNSNWESSASKPVSGWG 

EGGQNEIGTWGNGGNASLASKGOWEDCKRSL 

PAWNETGRQPNSWNKQHQQQQPPQQPPPPQ 

PEASGSWGGPPPPPPGNVRPSNSSWSSGPQPA 

TPKDEEPSGWEEPSPQSISRKMDIDDGTSAWG 

DPNSYNYKNVNLWDKNSQGGPAPREPNLPTP 

MTSKSASDSKSMQDGWGESDGPVTGARHPS 

WEEEEDGGVWNTTGSQGSASSHNSASWGQG 

GKKQMKCSLKGGNNDSWMNPLAKQFSNMG 

LLSQTEDNPSSKMDLSVGSLSDKKJFDVDKRA 

MNLGDFNDIMRKDRSGFRPPNSKDMGTTDS 

GPYFEKGGSHGLFGNSTAQSRGLHTPVQPLN 

SSPSLRAQVPPQFISPQVSASMLKQFPNSGLSP 

GIJNVGPQLSPQQIAMLSQLPQIPQFQLACQL 

LLQQQQQQQLLQNQRKISQAVRQQQEQQLA 

RMVSALQQQQQQQQRQPGMKHSPSHPVGPK 

PHLDNMVPNALNVGLPDLQTKGPIPGYGSGF 

SSGGMDYGMVGGKEAGTESRFKQWTSMME 

GLPSVATQEANMHKNGAIVAPGKTRGOSPY 

NQFDnPGDTLGGHTGPAGDSWLPAKSPPTNK 

IGSKSSNASWPPEFQPGVPWKGIQNIDPESDP 

YVTPGSVLGGTATSPIVDTDHQLLRDNTTGS 

NSSLNTSLPSPGAWPYSASDNSrmVHSTSAK 

FPDYKSTWSPDPIGHNPTHLSNKMWKNHISS 

RNTTPLPRPPPGLTNPKPSSPWSSTAPRSVRG 

WGTQDSRLASASTWSDGGSVRPSYWLVLHN 

LTPQIDGSTLRTICMQHGPLLTFHLNLTQGTA 

LIRYSTKQEAAKAQTALliMl^vjLUiNi lU-Anr 

ATDDEVSRFLAQAQPPTPAATPSAPAAGWQS 

LETGQNQSDPVGPALNLFGGSTGLGQWSSSA 

GGSSGADLAGASLWGPPNYSSSLWOVPTVED 

PHRMGSPAPLLPGDLLGGGSDSI 


1049 


2399 


A 


8748 


200 


1387 


' VPWKRQDEQLSLQVETLYLUSPAVIHLLSPTF 
LPPSSLPPFLQJVDSSSSACTLDSFFPFLAPWDS 
PQDCGFKDHQPLTLQALTVELARWTLMLLLS 
TAMYGAHAPLLALCHVDGRVPFRPSSAVLLT 
ELTKLLLCAFSLLVGWQAWPQGPPPWRQAA 
PFALSALLYGANNNLVIYLQRYMDPSTYQVL 
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SEQID 
NO: of 
nucl- 
eotide 
seq. 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 


Predicted 

beginning 

nucleotide 

location 

correspondi 

nt? to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
I>=Aspartic Acid, E=Glutaniic Acid, 
F=Plienyialanine, G=Glycinc, H«Histidine, 
Msolcucine, KHLysine, Lr»Lcucinc, 
M=Mctiiioninc, N=Asparaginc, P^Proline, 
Q=Glutaminc, R=Argininc, S=Serine, 
T^Threonine, V=VaIine, W=Tryptophan, 
Y~Tyrosine, XKJnknown, ♦=5top codon, 
/=^ssible nucleotide deletion, V^osslble 
nucleotide insertion 














SNLKIGSTAVLYCLCLRHRLSVRQGLALLLL 

MAAGACYAAGOLQVPGNTLFSPPPAAAASP 

MPLHIXl^GUXLILYCLISGLSSVYTELLMKR 

QRLPLALQNLFLYTFGVLLNLGLHAGGGSGP 

GLLEGFSGWAALWLSQALNGLLMSAVMKH 

GSSITRLFVVSCSLVVNAVLSAVLLRLQLTAA 

FFLATLUGLAMRLYYGSR 


1050 


2400 


A 


8758 


3 


1660 


WVSSMGFEELLEQVGGFGPFQLRNVALLALP 

RVLLPLHFLLPIFLAAVPAHRCALPGAPANFS 

HQDVWLEAHLPREPDGTLSSCLRFAYPQALP 

NTTLGEERQSRGELEDEPATVPCSQOWEYDH 

SEFSSTIATESQWDLVCEQKGLNRAASTFFFA 

GVLVGAVAFGYLSDRFGRRRLLLVAYVSTLV 

T fii AQAAQV^WMFAITRTLTGSALAGFTnV 

MPLELEWLDVEHRTVAGVLSSTFWTGGVML 

LALVGYLIRDWRWLLLAVTLPCAPGILSLW 

VPESARWLLTQGHVKEAHRYLLHCARLNGR 

PVCEDSFSQEAVSKVAAGERWRRPSYLDLF 

RTPRLRHISLCCVWWFGVNFSYYGLSLDVS 

GLGLNVYQTQLLFGAVELPSKLLVYLSVRYA 

GRRLTQAGTLLGTALAFGTRLLVSSDMKSWS 

TVLAVMGKAFSEAAFTTAYLFTSELYPTVLR 

QTGMGLTALVGRLOGSLAPLAAIXDGVWLS 

LPKLTYGGIALLAAGTALLLPETRQAQLPEn 

QDVERKSAFTSLQEEEMPMKQVQN 


1051 


2401 


A 


8759 


515 


1625 


EEDGGWKVEKELENTEQPVGGNEWEHEV 

TGNLNSDPLLELCQCPLCQLDCGSREQLIAHV 

YQHTAAWSAKSYMCPVCGRALSSPOSLGR 

HLLIHSEDQRSNCAVCGARFTSHATFNSEKLP 

EVLNMESLPTVHNEGPSSAEGKDIAFSPPVYP 

AGBLLVCNNCAAYRKLLEAQTPSVRKWALRR 

QNEPLEVRLQRLERERTAKKSRRDNETPEERE 

VRRMRDREAKRLQRMQETDEQRARRLQRDR 

EAMRLKRANETPEKRQARLIREREAKRLKRR 

LEKMDMMLRAQFGQDPSAMAALAAEMNFF 

QLPVSGVELDSQLLGKMAFEEQNSSSLH 


1052 


2402 


A 


8763 




7n 
t\f 


RHGHGGRDRRGGGRVARPGGLGRYPGRGAA 

ASLVFVPTRRRSGPSGTASVAAMAYHSGYGA 

HGSKHRARAAPDPPPLFDDTSGOYSSQPGGY 

PATGADVAFSVNHLLGDPMANVAMAYGSSl 

ASHGKDMVHKELHRFVSVSKLKYFFAVDTA 

YVAKKLGLLVFPYTHQNWEVQYSRDAPLPP 

RQDLNAPDLYIFTMAFITYVLLAGMALGIQK 

RFSPEVLGLCASTALVWWMEVLALLLGLYL 

ATVRSDLSTFHLLAYSGYKYVGMILSVLTGL 

LFGSDGYYVALAWTSSALMYFIVRSLRTAAL 

GPDSMGGPVPRQRLQLYLTLGAAAFQPLIIY 

WLTFHLVR 


1053 


2403 


A 


8768 


2 


712 


RPPRVWYPELRELSAAAPRWSHRTAPGIMVF 

YFrSSSYNSSAYTIYMGICDKYENEDLIKHOW 

PEDIWFHVDKLSSAHVYLRLHKGENIEDIPKE 

\a.MDCAHLVKANSIQGCKMNhm^VVY^ 

SNLKKTADNIDVGQIGFHRQKDVKIVTVEKK 

VNEILNRLEKTKVERFPDLAAEKECRDREER 

NEKKAQIQEMKKREKEEMKKKREMDELRSY 

SSLMKVENMSSNQDGNDSDEFM 


1054 


2404 


A 


8769 


344 


327 


REATTLACRNSCWVFSRCSLGACKPTVCSMP 
SLSRQGSQTXCLRLAEYCMESVDSQRLLLS 
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SEQID 
NO: of 
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seq- 
uence 


Miet 
bod 


SEQ 

©NO: 

n 

USSN 
09/496 
914' 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C>=Cystcine, 
l>Aspartic Acid, EpGlutamic Acid, 
F^Phenylalanine, Q=01ycine, H=Histidine» 
Msoleucine, K-Lysinc, L»=Leucinc, 
M-Methionine, N=Asparagine» P^Prolinc, 
Q=Glutaminc. R=Arginine, S=Scrine, 
T^Threonine, V=Valine, W=Tryptophan, 
Y«Tyrosinc, X-Unknown, *«StDp codon, 
possible nucleotide deletion, V^ssible 
nucleotide insertion 


10S5 


2405 


A 


8770 


430 


1104 

332 


QQESPAAGAARMNCKEGTDSSCGCRGNDEK 
KMLKCVWGDGAVGKTCLLMSYANDAFPEE 
YVPTVFDHYAVTVTVGGKQHLLGLYDTAGQ 
EDYNQLRPLSYPNTDVFLICFSVVNPASYHNV 
QEEWVPELKDCMPHVPYVUGTQIDLRDDPK 
TLAW-LYMKEKPLTYEHGVKLAKAIGAQCYL 
ECSALTQKGLKAVFDEAILTIFHPKKKKKRCS 
EGHSCCSn 

NPRIQLSGNSCCAGSCRVWLSJig 


1056 
1057 


2406 
2407 


A 
A 


8773 
8778 


261 
3 


477 


■ PAGIRHEQARGADRMGKCRGLRTARKLRSH 
RRDQKWHDKQYKKAHLGTALKANPFGGAS 
IT A V r«TV/T cir VO VP A K OPMS AIRKCVRVOLDC 
NGKKITAFVPNDGCLNFIEENDEVLVAGFGR 
KGHAVGDIPGVRFKWKVAKVSLLALYKOK 
KERPl^S 


1058 


2408 


A 


8808 


171 


881 


PGLSQEPSGSMETVVIVAIGVLATlFLASl'AAL 
VLVCRQRYCRPRDLLQRYDSKPIVDUGAME 
TQSEPSELEUDDVVrrNPHIEAILENEDWIEDA 
SGLMSHCIAILKJCmTEia.VAMTMGSGAK 
MKTSASVSDUWAKRISPRVDDWKSMYFPL 
DPKLLDARTTALLLSVSHLVLVTRNACmTG 
GLDWIDQSLSAAEEHLEVLREAALASEPDKG 
LPGPEGFLOEQSAI 


1059 


2409 


A 


8809 


246 


757 
381 


MRLQGAIFVLLPHLGPILVWLhTRDHMSGWC 

EGPRMLSWCPFYKVLLLVQTAIYSWOYASY 

LVWKDLGGGLGWPLALPLGLYAVQLTISWT 

VLVLFFTVHNPGLAIXmXLLYGLWSTALI 

wuPTMifT A Af T I LPYLAWLTVTSALTYHLWR 

DSLCPVHQPQPTEKSD 

PKLSVYPLOSHHCLSEPFQSLVCCLA 


1060 
1061 


2410 
2411 


A 
A 


8810 
8820 


304 
1673 


848 


SCKTENLLEMWWFQQOLSFLPSALVIWTSAA 

FIFSYITAVTLHHIDPALPyiSDTGTVAPEKCLF 

GAMLNIAAVLCIATIYVRYKQVHALSPEENVI 

IKLNKAGLVLGILSCLGLSIVANFQKTTLFAA 

HVSGA\a,TFGMGSLYMFVQTILSYQMQPKIH 

GKQVFWIRLLLVIWCGVSALSMLTCSSVLHS 

fUJFriTnT l^ia.HVW»n*HDKGYVLHMnTAAE 

WSMSFSFFGFFLTYIRDFQKISLRVEANLHGL 

TLYDTAPCPINNERTRIXSRDI 


1062 


2412 


A 


8824 


1 


763 


GGAPPASVPARESPVSGAQGSSRTRGHIUIAA 

GARAPQLCSSWQRRSAPAMSRGLQLLLLSCA 

YSLAPATPEVKVACSEDVDLPCTAPWDPQVP 

YTVSWVKIJLEGGEERMETPQEDHLRGQHYH 

QKGQNGSFIMPNERPYSUCIRNTrSCNSGTYR 

CTLQDPDGQRl^SGKVILRVTGCPAQRKEET 

FKKYRAEIVLIXALVIFYLTUIFraCFARLQSI 

FPDFSKAGMERAFLPVTSPNKHLGLVTPHKT 

ELV 


1063 


2413 


A' 


8826 


147 


627 


' CETSTSSAGHAPaUiAAQGPPAEPTGLllLCbb 
HQRLHAWPPGPRRPSLWPPKNGKWHSGKRT 
AGGRPQRRPSRRQSQRPSAWSGSPRMHSPGQ 
KCSLMCPHRSQDSLSTAIFQRSPGANTGRALH 
CVI^KEMKSVQRS1X5I^RIHU)SKRKIIHFVL 
TR 


1064 


2414 


A 


8835 


2982 


1869 


■ LKDTLKSQNn'QEASDEAEDMKEAMNRMlUlS 
LNKQVSELSQLYKEAQAELEDYRKRKSLEDV 
TAEYIHKAEHEKLMQLTNVSRAKAEDALSE 
MKSQYSKVLNELTQLKQLVDAQKENSVSITE 
HLQVnTLRTAAKEMBEiaSNLKEHLASKEVE 
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Amino acid sequence (A=AIaninc C=Cysteine, 
D=Aspartic Acid, E^Glutamic Acid, 
F=Phenylalaiunc, GKilycine, H'=Histidine, 
I^Isoleucine, K^Lysine, L^L^cinc, 
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T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, »=Stop codon, 
/^possible nucleotide deletion, N^possible 
nucleotide insertion 




• 










VAKLEKQLI^EKAAMTDAMVPRSSYEKLQS 

SUSSEVSVLASKLKESVKEKEKVHSEVVQIRS 

EVSQVKREKENIQTLLKSKEQEVNELLQKFQ 

QAQEELAEMKRYSESSSKLEEDKDKKINEMS 

KEVTKLKEALNSLSQLSYSTSSSKRQSQQLEA 

LQQQVKQLQNQLAECKKQHQEVISVYRMHL 

LYAVQGQMDEDVQKVLKQILTMCKNQSQK 




2415 


A 


8841 


3 


663 


"AaataaslsprgCrlrtpssdvgpsrapppsa 

APLFTGRAQMSPSGRLCLLTIVGLILPTRGQTL 

kdttssssadatimdiqvptrapdavytelqp 

TSPTFTWPADETPQPQTQTQQLEGTDGPLVT 

dpethbcstkaahptddtttlserpspstdvqt 
dpqtlkpsgfheddpffydehtlrkrgllva 

AVLFTTGfflLTSGKCRQLSRLCRNHCR 


1066 


2416 


A 


8853 


3806 


2204 


FVGEQEGGCaBAGAGRGAQmGEAGERWl-Cj 

RRRRRGRWSRKKMSLKSERRQIHVDQSDLL 

CKKGCGYYGNPAWQGFCSKCWREEYHKAR 

QKQIQEDWELAERLQREEEEAFASSQSSQGA 

QSLTFSKFEBKKTNEKTRKVTTVKKFFSASSR 

VGSKKEIQEAKAPSPSINRQTSIETDRVSKEFIE 

FLKTHIKTGQEIYKQTKLFLEGMHYKRDLSIE 

EQSECAQDFYHNVAERMQTRGKVPPERVEKI 

NTOQIEKYIMTRLYKYVFCPETTDDEKKDLAI 

qkriralrwvtpqmlcvpvnedipevsdmw 
kattdiiemdskrvprdklacitkcskhifnai 
kitknepasaddflptliyivlkgnpprlqsni 
qytmcnpsrlnn'gedgyyftnlccavafie 

KLDAQSLNLSQEDFDRYMSGQTSPRKQEAES 
WSPDACLGVKQMYKNLDLLSQLNERQERIM 

neakklekdlidwtdgiarevqdivhcyplei 

KPPNOPLAAIDSENVENDKLPPPLQPQVYAG 


10^7 
1068 


2417 
2418 


A 
A 


8855 
8856 


1372 
1530 


1513 
1583 


"SNMREVGCGWLVPVIPAPWEAEVGGSLKAKb 
LRQAWATKODPISKKK 
PCRPGMECNSMISVHCNL 


1069 
1070 


2419 
2420 


A 
A 


1 8857 
8866 


1530 
293 


1583 
1675 


PCRPGMECNSMISVHCNL 

PYPQGGYPQGPYPQEGYPQGPYPQGGYPQGi' 

YPQSPFPPNPYGQPQVFPOQDPDSPQHGNYQ 

EEGPPSYYDNQDFPATNWDDKSIRQAFIRKVF 

LVLTLQI^VTLSTVSVFTTVAEVKGFVRENV 

WTYYVSYAVFFISLIVLSCCGDFRRKHPWNL 

VALSVLTASLSYMVGMIASFYNTEAVIMAVG 

ITTAVCFTVVIFSMQTRYDFTSCMGVLLVSM 

WLFIFAILCIFIRNRILEIVYASLGALLFTCFLA 

VDTQLLLGNKQLSLSPEEYVFAALNLYTDIINI 

FLYILTnGRAKE*PSSSSLCPLRWHGWPGPCP 

WHGSASCTSPLSCPQAQPREKDASLQPSCMY 

TADTSIWTRCGHSMAPLVLPPPPRGTKATFPC 

HT T <rrHCCMSPVCOPTPGTGGSTRSRGEGLSQ 

EVRVHVFPPVPAPQPGVEHPSPPPHPPOVLPS 

GDMRSGGUPVLSPE 


1071 


2421 


A 


8868 


2 


358 


■ ARGNTLYHLPRLCRKLNLRWFSAS PLYD VQH 
DDKMGSNTFFKRNDCRYVMISCKADMAYDN 
VRHPFMI*SI\KLIMEETYLNIIKAVYDRPTASn 
T,NfiRKLKVFPVRSGT*QGCSVWP 


1072 


2422 


A 


8870 


33 


658 


" ■MESVLSKYEDQrm'1'DVLbEYPDTDELVWiL 
GKQHLLKTEKSKLLSDISARLWFTYRRKFSPl 
GGTGPSSDAGWGCMLRCGQMMLAQALICRH 
LGRDWSWEKOKE0PKEYQR3LOCFLDRKDC 
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Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E^lutamic Acid, 
F«Phenylalanine, G«Glycine, H^Histidine, 
Hsoleudne, KpLysinc, L^Lcucinc, 
M=Mcthionine,N=A^>aragine. P^Proline, 
Q=<31utainine, R«Arginine, S=Scrine; 
T«Threomne, V=Valinc, W-Tiyptophan, 
Y«Tyiosine> X=Unknovwi, *'^top codon, 
^=posstble nucleotide deletion, V=possible 
nucleotide insertion 














"CTSIHQMAQMOVQEGKSIOEWVLGPNTVXAQ 
GV*KNIA\IJFDEWWSLGLVYVSM\DNPSGSIA 
RFPKKLCRVLPL\SADTAGLTGP 


1073 


2423 


A 


8879 


146 


412 


DFSV*GDVDIEVTCPICLqLLTEPLSLNCGLRL 
♦QVCITA*IKESVnSGG*SSSPVCHnFQPANL 
RTSRYLPT*SIKSLGPDEPQEG 


1074 


2424 


A 


8884 


67 


435 


HLQGRSIRTLQLl'GKNlKNCEVSEKIKkSiii'W 
KEISFGDYICHTFQGDCWADRSPLHEAAAHG 
RJXALKTLIAQGVNVNLWTL/DRVSSLHEACL 
♦GPVACAKPYWKMVPRHGOTVTGPPLLMV 
r»or>KbxTr'T Tuni nnT ^OR^RNOSYRSRSRSR 


1075 


2425 


A 


8896 


1294 


"248 


SRERPSAPRGIPFASASSSVYYGSYSRPYGSDK 

PWPSLLDKEREESLRQKRLSERERIGELGAPE 

VWGLSPKNPEPDSDEHTPVEDEEPKKSTTSAS 

TSEEEKJKKKSSRSKERSKKRRKKKSSKRKHK 

KYSEDSDSDSDSETDSSDEDNKRRAKKAKKK 

EKKKKHRSKKYKKKRSKKSRKESSDSSSKES 

r\trt?T7i iTVTDWlfmJTKAFFP^DLlGPEAPKTLTS 

QDDKPLNYGHALLPGEGAAMAEYVKAGKRI 
PRRGEIGLTR*RNCHHLNAQVM**VVSRHRR 
MEAVRTAKREPESTVLMRREPLHPFNPRRET 

IvJbKn 


1076 


2426 


A 


8899 


146 


789 


GRSTEAEKEPAFDERTGKORRLPRAGEFHG'b 

*APGPGPRSFQVSRKMPEE«>PGARKHPFSGiCS 

rvT m PAaiOOT^OFLTGAIOOLGGVIEGFLSKE 

VSYIVSSRREVKAESSGECSHRGCPSPSPSEVR 

VETSAMVDPKGSHPRPSRKPVDSVPLSRGKE 

1 T nKAIRNOK**CTVQQLSHCRLY\OE!CrTAK 

RSOREHVQQQSQEHGKWPDLKGPR 


1077 


2427 


A 


8901 


352 


3 


AKIGAYKYIQELWRKKQSDVMHFLLRVRCW 
QYPALHRAGTEWQLSALHRAPRSTQPDKAC 
RLGYKAKQGYHYRICVRRGGWKCPVPKAVT 
\YGKPVHHGVN*LKFAQSLOSVAEEQ 


1078 


2428 


A 


8905 


536 


781 


ACPAENREVPEMAAGQAPHAGPGAGPGQPA 
PALPFAATPGSRGQALCRGGRRRQHLHGPLH 
RP*0AAPALHAGCX3LAPHPPT 


1079 


2429 


A 


8912 


121 


376 • 


"NLIWKLCVTERRLVILDNYDLASBTEANKYl 
CNRnQFKPGQDBCYFrLGLPTGSTPL*CYPKLI 
EYNKNGHLSFKYVK1FSMDEY 


1080 


2430 


A 


8920 


381 


1788 


-SSESPSDPGRMAMTWIVFSLWPLTVFMGHIU 
GHSLFSCEPITURMCQDIJYNrTFMPNLLNHY 
IXJQTAALAMEPFHPMVNLDCSRDFRPFLCAL 
YAPTrMEYGRVTLPCRRLOQRAYSECSKLME 
MFGVPWPEDMECSRFPDCDEPYPRLVDLNLA 
GEFTEGAPVAVQRDYGFWCPRELKIDPDLGY 
SFLHVRDCSPPCPNMYFKREBLSFARYFIGLIS 
nCLSATLFTFVTFLIDVTRFRYPERPIKCYAV 
WHMMVSLIFFMGFLLEDRVACNANSIPAQYKA 
STVTQGSHNKACTMLrMU* I rr 1 MAuo v w w 
VILTITVmJ^WKWGSEAIEKKALLFHASA 
WGIPGTLTIILLAMNKIEGDNISGVCFVGLYD 
VDALRYFVLAPLCLYVWGVSLLLAGnSLNR 
VRIEIPL*KENQDKLVKrMIRIGVFSILYLVPLL 
WIGCYFYEOAYRGIWETTWIQERC 


1081 
1082 


2431 
2432 


A 
A 


8922 
8923 


56 
355 


420 
1079 


" EERTKMSTGPDVKATVGDlSSDGNLNVAQEb 
CSRKGIVDEFFPLLSN^CIWTQPQGYPQSSYG 
TLANFVF\CSVRHGLALILQLCNFSIYTQQMN 
LSIAIPAMVNNTAPPSQPNASTBRPST 

- ■ pFGTFSSTMAVVKNKCLhflKGGKKOVKKKV V 
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I=lsoleucine, K=Lysinc, L=Leucine, 
M=Methionine, N«Asparaginc, P=»Prolinc, 
Q=Olutainine, Kr-/vrginine, o-ocruic, 
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Y=Tyrosine, X=Unknown, ♦=Stop codon, 
A=possible nucleotide deletion, >Fpossible 
nucleotide insertion 














GPFSKmQYDVKAPAMFNIRNTGKnXVART 

FKUTEDVQDKNCLTT^GMDLTCDKICSMV 

EKWSTMIEAHVDVKTTOGYFFHLFCVGFTKK 

HNNQDLKTSyA*HQQS/RQIQKKMMEIMT*EV 

QTNDLKEWNKLIPDNIGKDTEKV/CPIYPLH 

DWIRKVKMLENrurliKXivUiuKUuuooo 


1083 


2433 


A 


8948 


28 


385 


iTWPQPHffSCPAMSEETLQSKLAAAKKKLP 
WG AVQGSRAMSDLLLLLLJJL 1 LLJ-LLMU-r w 
AGYSGQLAGVAVSAGSPPI/RYKFHVEPYGET 
GWLLT/ESCSISPKLCSIAVH*DNPAWF 


1084 


2434 


A 


8950 


156 


318 


HYTl'lJNrui UiNSENNKCW'vj I 'b\viji.iHl J w 
WGGKRVQPFWKRVWQKRTLNLRV 


lois 


2435 


A 


8956 


16 


413 


HMGQLGYFIQCWWECKRLISF\WKTI*QSPAK 

*TIYTSYDTAIPIS/GI/YPKRMSSKCHQETCAR 

MFILAPFTATKGKQLTCPLVEERIDYMVIWYS 

HKYYIKVKRNL*VTITH\TWVNLNILMFBI1^ 

YSHKYY 


1086 


2436 


A 


8962 


868 


1026 


H*KIU3VGRAQRAHXSRL*SQLLRRLRHESHL 
NPGARGCSEARLHRCTPAWTT 


1087 


2437 


A 


8985 


58 


330 


LHVKHLGHFQLVFSEV1CHCILMPVS*ELQRL 
♦ERSVCAFHVCIQTYVCLQVYACMCVYYICM 
FVYSVYGCGLCTCVCMDVYICVCVQEFL 


1088 


2438 


A 


89.89 


394 


404 


N*KWILHVNVR1QSIFF/IKRNQ*;/UM bWtiLJfii/Li 
KKPLDMMSNA*STKKHDKLD/LIKFKTyLCSA 
KYTVKRIKIHPTDLEKMLRNHLSDKD*YS/GV 
YKDLSFXNRJRKTE/S*/VKKWVKDLSRYFIKE 
VISMENKHKKIFSTS 


1089 


2439 


A 


8991 


60 


329 


MALTPESPSSFPGLAATGSSVPEPPGGPNA 1 L 
NSSWDSPTEPSSLEDLEATGTIGTLLSDMGW 
GVEDNAYTLEvNbRYMKAVulM'iiiL* 


1090 


2440 


A 


8996 


2 


351 


SNITITLT*MKKYDNTFCW*QCGQIG/T/UYC 
WQESKFIQAFWSKIQQYLAnSIHILFDPAFLFL 
GGYPGGTQSVFLTGVLVSSVFYmiKMLHTR 
LLIAALFUVQYWKQSKDHYI 


1091 


2441 


A 


8997 


97 


456 


YPLPVCSYLSGPRGEHWNSLGGKSSCPLPLPr 
LVSSRFKISKVIWGDLSVGKTCLINR*GGAG 
AELGRVGPSLARWAGSRSQHLVPSQWCKDS 
FDKNYKAPIGADFEMERFEVLGIPF 


1092 


2442 


A 


8999 


548 


811 


SSFIKRHILIFEDDWHQrrCCHHPHHPW»RUQ 
FHIFYVSVQNSISPSLSVSSSHPDRPDHEVHQH 
RAAHHHOHGQGPLGHGLVARVG 


1093 


2443 


A 


9002 


3 


2745 


ALLGLQQPAQSLILSRSSVMGVRGLQGFVUS 

TCPHICTVVNFKELAEHHRSKYPGCTTTIVVD 

AMCCLRYWYTPESWICGGQWREYFSALRDF 

VKTFTAAGIiaiFFFDGMVEQDKRDEWVKRR 

LKNNREISRFHYIKSHKEQPGRNMFFIPSGLA 

VFTRFALKTLGQETLCSLQEADYEVASYGLQ 

HNCLGILGEDTDYLIYDTCPYFSISELC3jESLD 

TVMLCREKLCESLGLCVADLPLLACLLGNDU 

PEGMFESFRYKCLSSYTSVKENFDKKGNULA 

VSDHISKVLYLYQGEKKLEEELPIWTKQSSFL 

♦RNGnSFTRT/INLHGFSKNPKV**LWTNK*YP 

RVQTPNPGKKFPCVQMLNPGKKFPCVQALNP 

GEKPPCIHI/PEPRQEVPTCSDPEPRQEVFTCTG 

PESRREVPMCSDPEPRQEVPMCTGPEPRQEVP 

MCTGPEARQEVPMCTDSEPRQEVPMCTDSEP 

RQEVPMYTGSEPRQEVPMYTGPESRQEVPMY 

TGPESRQEVLIRTDPESRQEIMCTGHESKQEV 
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hod 
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amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alaninc OCysteinc, 
I>=Aspartic Acid, E^Iutamic Acid, 
F=PhenyIalanine, G=Glycinp, H=Histidinc, 
l=IsDlcucinc K=Lysine, LpLeucine; 
M=Mettiionine, N=Asparaginc, PMProrme, 
Q=Glutamine, R=Argimne, SaSerine, 
T«Threonine, V«Valinc» W«Tryptophan, 
Y^Tyrosinc, X=Unknown, *=Stop codon, 
/^possible Qudeotide deletion, >Fpossib]e 
nucleotide insertion 














LEALMCTNPEIKQEDPTNVGPEVKQQVTMVS 

DTEILKVARTHHVQAESYLVYNIMSSGEIECS 

NTLEDELDQALPSQAFIYRPIRQRVYSLLLED 

CQDVTSTCIAVKEWFVYPGNPLRHPDLVRPL 

QMnPGGTPSLKILWLNQEPEIQVRRLDTlXA 

CFNLSSSREELQAVESPFQALCCLLIYLFVQV 

DTLCLEDLHAFIAQALCLQGKSTSQLVNLQP 

DYINPRAVQLuoLL YKuJ-. I iL»VL»viNo/\*-*vjrr 

WKTSDFMPWNVFDGKLFHQKYLQSEKGYA 

VEVL/CRTK*ISAHQIPQPEGSRLQGLHEOEQT 

HHWPSPLGLTPRREVGKTGLQLPQDGLWV 


1094 


2444 


A 


9021 


97 


834 


"AREACRAKTDFPGRRFRLWPSCCCRVIVGAE 
T*H\MAEPVSPLKHFVLAKKAITAIFDQLLEFV 
TEG SHFVEATYKNPELDRIATbUUL V DMyu i 
KDmnGEVLSRRHMKVAFFGRTSSGKSSVI 
NAMLWDKVLPSGIGHITNCFLSVBGTDGDKA 
YLmIEGSDEKKSVKI VNviL/UiAi*nMUfti»L.rk. 
AGCLVRVFWPKAKCAIXRDDLVLVDGPGTD 
VTTELDSWIDKFCTKSSTRErmSOSDT 


1095 


2445 


A 


9022 


1 


537 


" lvlnsrvedfvppegagrtxpfalrflaacw 

T T TTTkT* Annexe AT ODDDDQU//^VQrV^17/vAOAT2P 

LLHRRARRSS ALCrKrJO Wu v ouucaj/vvi/wc 

p*itssscclsaa/shlsiqspnmagarrrirpq 

LAKEKffiGCfflCTSVTPGEPQVFLGKDKAFTF 

dyvfdidsqqeqiyiqciekliegcfegynatv 
faygqtvqagktytmgtofd 


1096 


2446 


A 


9029 


1 


285 


gqhsetpslkkk\lagysgmcl»sqvlrrlrq 
edclspgggncres* scpytpawtterdpv 


1097 


2447 


A 


9032 


716 


357 


lahcnfrhaofpplsclslpnrweyrrpparp 

nwvf VFT VPTfrFnn/G+DfiLDLLTSRSACLG 
U>KCWDYRREPAASrafQTrFFINSK 


1098 


2448 


A 


9038 


230 


652 


KWVMSCEDINIS6SFYRWKUCYI^ato 
TKiP^nrtPVMT WVPSSHIFWOTTCGRLPHKIKO 

g*aaldhlkvfdriplpydkkkqmavsaile 
wrpkp*rkfaylghwaqkvdwkyqamta 
tmgekrkvyyqkicyqkk 


1099 


2449 


A 


9043 


185 


372 


■ HFYSHQQCMRV/WQGCGDIETLIHCW'E'KII 

hsl/wk/iv*qfuau-ylhlphnsviaflgisp 
rkktcpqnsctsmlinaihndqkwkxini 


UOO 


2450 


A 


9045 


763 


584 


rqslalsprlecsgtisahcrlcplvfimcl 
sltsswdyrrppphpanflyfk*rrgf 


1101 


2451 


A 


9050 


275 


2 


T PFT rkvwoflspsllpvnfogfvfaflllll 
fll/femeslpva/rvecsgtisahoaclposs 
dspasas*vagitdmcrytolilfhas 


1102 


2452 


A 


9053 


449 


1224 


ktsmfwkfdlhssshidtlleredvtlkelm 

deedvlqeckaqnrkliefllkaecledlvsf 

i\*eeppqdmdekirykypn1scelltsdvsqm 

ndrlgedesllmklysfllndsplnpixasff 

skvi^ilisrkpeqivdflkkkhdfvdliikhig 

tsaimdlllrlltcieppqprqdvln/wfkvq 

rnl*hst*nvmdiskyvnlhwglnkshsll* 

lllqcvlqwlneekhqrlveivhpsqeedvs 

SLV 


1103 


2453 


A 


9058 


403 


3 


" GLHVYDFQVYREHILTLNVKKCSVSFWGLKb 
WLYLQMYEnKSPRFPIKMTDITKCW^GaGA 
AGMQim/CW\WCVNVGKFWEMS*YYLLKLSI 
ST/PYDPAIPIXGm-*ETRVYIHPKTCMRMLlA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 

nucleotide 

location 

to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alamne OCysteinc, 
I>Aspartic Acid, EOlutamic Acid; 
FfePhenylalanlne. G=Glycine. H^Histidine, 
Issisoleudne, K-Lysine;, LpLeucine^ 
M^Methiomne, N=As(^TOgine, Ps^Proline, 
Q=<jlutaiiune, ReArghine, &=Serine, 
T«Threoninc, V-Valine, W-Tiyptophan, 
Y=Tyrosinc, X-Unknovni, ♦=Stop codon, 
/^=possible nucleotide deletion, V^possible 
nucleotide insertion 














APFVLAVNC 


1104 


2454 


A 


9064 


75 


393 


KWLFSSLNITGRGDnOHLKWLDCRVNCSSFPI 
KR>niQTHSmESNKLKAQHSFGYN*LIH*NS\V 
KTDCGCGANSKGVVVVM1CV\KTAQQKQTTS 
YMQIGTTKNSRAT 


1105 


2455 


A 


9065 


366 


778 


DIXILIWl^PELKRRNCISRFYLAYHLHKIYS 

JK?>I 1 -1 A^iNNv^aur I HjOLi v^iuy rrrr^ i rrmuK 

AWPCCPGWSAAWLTIVILAHYRRPOLERSCC 
LSLSSSWDHRRVPPCPANP/YFSMGFTAFPRL 
VLNS^TQGI 


1106 


245^ 


A 


9083 


673 


816 


ESGSLIH*WWENKPAQPLWWEI»QHVQKLPT 
HFPCDPAIPLLGICPED 


1107 


2457 


A 


9086 


580 


18 


KPSSCiSi^lKAi Yit LoTAH Vr ALT 5 vL V K J JvL 1 ' 
AFSQSSVLWAHKQQKTSLSLVIR/ERLQIKTA 

•% mr^vrr titt>t a VTT r\XR/V011/fV^/C/«Q!MX4d 

XnRENFLFDu-AKJLrKJ-lJN V W V^/ouoJNMol^ 
I/HCWWEYNVMIWNSVTFPRKVEHVYrrYA 
PEISVR*IHGGLPTLVHQETHTSVFROAPSVIP 
ETRVCRFTKESINKLLHnTMEHYGDENK 


1108 


2458 

0 


A 


9093 


540 


1 


GGNDUSVIFI ilil'UKJUSll''*U:usjr''iirwijLiRi-r 

SAFPPAERSRGHRRASL*RARWSAAVPRRSA 
GSASEPVQSRWLRLPVGSDSPPAVPVRVCPAP 
DSRPAAPGSRLPDPGLDSPAPSRTPSSSVD^OG 
QRPPPPSGDSLSPPGCCRY 


1109 


2459 


A 


9099 


1255 


1425 


HESYHVNPNLCNPVAPTSGAHSIG'KWPSWL 
GAVAHSCNPSTLVGRGGRTTRGQELR 


1110 


24^ 


A 


9103 


242 


70 


EEQFFFFAVGMFP*VDFLAPASGELWDRLRLT 
CSRPFTRHQSFGLAFLRVCSSLDSLDDSWGP 

LLRKQRNKRMAIP 


nil 


2461 


A 


9110 


189 


121 


SFLSVRLECNGAIMAHCALPLPG 


1112 . 


2462 


A 


9113 


100 


910 


RRRGGGSRPRRTPVPAPGPGPSFGMDVRFYP 
AAAGDPASLDFAQCLGYYGYSKFGNNNNYM 
NMAb AiS N At r AAo til^ 1 r ii i r oLAjL'Ccr jurrx i 
PPPESDPALGMPDVLLPFQALSDPLPSQGSEFT 
PQFPPQSLDLPSmSRNLVEQDGVLHSSGLHM 

VMPPAQLTTINQSQLSAQLGLNLGGASMPHT 
SPSPPASKSATTSPSSSINEEDADEANRAIGEK. 
RAAPDSGKKPKTPKK 


1113 


2463 


A 


9120 


3452 


3051 


FLRPSFALVPQAGVQWCALSWLQPPSFRFJC^F 

SCLSLPSSWDYRHVPPRPANFFVLLVETGFLH 

VGQAGHEPLTSGDPPASASQSAGITGVSHQA 

WPSFFIFSRDTVLLCCSGWSRTSGLKQSACLS 

LLKCWDY 


1114 


2464 


A 


9122 


152 


377 


NQLPLQQWTFFIYETGFCSVAQAGVQCRDHS 
SLHP*PPG\SSDPPAPPS*VLGITGQRYHACLn 
YLYVQTVPQRV 


ins 


2465 


A 


9124 


553 


981 


QRPLLRQQLGSWPTCRSLEGDLASPW»*RLPG 

SPRMRRSGT/ATLNLPLSPQGTVRTAVEFQVM 

TQTQSLSFLLGSSASLDCGFSMAPGLDLISVE 

WRLQHKGRGRGDLHLPDHHLSVPSSADHPA 

QQPSQFNGRNLYFLPLFR 


1116 


2466 


A 


9135 


48 . 


410 


SASHEPAEHDGGADSLSASQPPRPAORPAOA 

QHVHVPPWTDVLAGQDRRAPTAODQAPWP 

APGGHVPSTRPHDPAEFHADEAAGRGGRGLQ 

PAAPHALPAGLPHGPPAPA/PAEGGGTP*GSA 

GAGGP*GSPAGRACGAAGCRPRPPRPAASSA 

♦NSAGS»GLVEGT*PPGAGHGAPSPAVGARLS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQED 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

corrcspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
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to last amino 
acid residue 
of peptide 
sequence 


Amino aciQ sequence ^APAianmc v^v^jsicui^ 
D=Aspartic Acid, B=Glutamic Acid, 
ps^henylalanine, G=Glycine, H==Histidinc, 
lolsoleucine, K«tysine, D=Leucine, 
M=4V(ethionine, N=Asparaginc, P=Proline, 
Q=Glutaminc, R^Arginine, S==Serine, 
I**! nrconinc, v^voiinc» vf^xryyisipuBSif 
Y=Tyrosine, X^Unknown, ♦=Stop codon, 
/^possible nucleotide deledon, V=possible 
nucleotide insertion 














CPARTSVQGGTWTC*APAGRPAGLGGWEAJi 
RESArrDCoAOo'DAU'uAlir WuAuolvo W uo 


1117 


2467 


A 


9141 


380 


939 


iCSGHWAKECLQPRIPPRPCPICVGPHWKSDCP 

TCPGAVPRAPGTLPQGSLTDSFPDLLSLVAED 

*CCLMASEASWTIT\ELWVTLTVEGKSVP/CL 

NTEATHSTLPSFQGPVSLASITWGIDGQASKP 

LKTPQLWCQLGQYSFMHYFLVIPTCPVPLLG* 

GILTKLSAFLTEPRLQPHLIAALSPSS 


1118 


2468 


A 


9154 


471 


2 


AAGQVWEVTSHLYLCrrSDAAGLRLLPPAES 
ERGEGGHCPAEAPLPPRPQYCLAKHPLLRKLP 
EEKIKLDPYLTQHTKINSKQIKYLSATIAKTTQ 
LVEGN1GVNLQNTELKQH*INGFLDTTPEAQE 
TKJEKTNKLNFIKKVKRQLAEWEKIFQIA 


1119 


2469 


A 


9155 


2 


3187 


ACPRLARRRRRVRSLRRRRGWLRARWSRGQ 

NNMAARRITQETFDAVLQEKAKRYHMDASG 

EAVSETLQFKAQDLLRAVPRSRAEMYDDVHS 

DGRYSLSGSVAHSRDAGRESLRSDVFSGPSFR 

SSNPSISDDSYFRKECGRDLEFSHSNSRDQVIG 

HRKLOHFRSQDWKFALRGSWEQDFGHPVSQ 

ESSWSQEYSFGPSAVLGDFGSSRLDEKECLEK 

ESRDYDVDHPGEADSV/LRGGSQVQARGRAL 

NIVDQEGSLLGKGETQGLLTAKGGVGKLVTL 

RNVSTKKIPTVNRITPKTQGTNQIQKNTPSPD 

VTLGTNPGTEDIQFPIQKIPLGLDLKNLRLPRR 

KMSFDUDKSDVFSRFGIEIIKWAGFHTIKDDIK 

FSQLFQTLFELETETCAKMLASFKCSLKPEHR 

DFCFFTIKFLKHSALKTPRVDNEFLNMLLDKG 

AVKTKNCFFEinCPFDKYD^QDRLLKSVTP 

LLMACNAYELSVKMKTLSNPLDLALAIJETTO 

SLCRKSLALLGQTFSLASSFRQEKIL*AVGLQ 

DIAPSPAAFPNFEDSTLFGREYIDHLKAWLVS 

SGCPLQVKJKAEPEPMREEBKMIPPrKPEIQAK 

APSSLSDAVPQRADHRWGTIDQLVKRVIEGS 

LSPKERTLLKEDPAYWFLSDENSLEYKYYKL 

KLAEMQRMSENLRGADQKPTSADCAVRAML 

YSRAVRNUaCKIJLP\WQRRGLLRAQGVLRG\ 

WKARRAVTTGTQTLLFLRAPOLKHHGRQAPO 

LSQAKPSLPDRNDAAKDCPPDPVGPSPQDPSL 

EASGPSPKPAGVDISEAPQTSSPCPSADIDMKT 

METAEKLARFVAQVGFElcvir olbNd l UWr JJi- 

WFLHDQNSSAFKFYKKKVFELCPSICFTSSPH 

NLHTGGGDTTGSQESPVDLMEGEAEFEDEPF 

GAGKSEGSTPADQLPGEAAEDDLAGAPALSQ 
ASSGTCFPRKRISSKSLKVGMIPAPKRVCXIQE 
PKGECPPVGTVASSTVLGWWAVRVRRDRWR 
HFNPKH?CAPWNVSRHSCFPVV 


1 nn 


zrrtXJ 


A 
r\ 


9163 


124 


207 


PPRACRPCPRACPCPPPKCSQPVSWPC 


1121 


2471 


A 


9166 


272 


523 


PMSSLQGCFYTFKCOFKGIFLLUSNUAF^'EK 

V/CSHTTDSLKFIGKGWVGMVTHACNPGTLG 

G*GGWIA*VRBFETSLGNM 


1122 


2472 


C 


9170 


442 


236 


MNRRRFLRPADCHSGMRGTENGACSEGESQI 
HCGAGGEGVQLVHWNQPENGCLQFDSTHrr 
FSKRQN* 


1123 


2473 


A 


9171 


10 


423 


MVDRSPLLTSVUFYLAIGAAIFEVLEEPHWKJi 
AKKNYYTQKLHLLKEFPCLGQEGLDKILEVV 
SDAAGQGVATTGNQTFNNWNWPNAMIFAAT 
VnriGYGNVASKTPGGRLFCGFYGLFGVPFC 
LTWINALOKFFO 
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SEQID 
NO: of 
nucl- 
eotide 
scq. 
uence 


SEQID 
NO: of 
peptide 
seq. 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A»Alaninc OCystcinc» 
D=Aspartic Acid, E^lutamic Acid, 
PsPhpnvlalanine. G=Glyclne. H=Histidine, 
l=Isolcucine, K^Lysine, L=ieucin^ 
M=Methionine,N=Asparagine, P=Pn)line. 
Q=Glutamine, R=Arginine, S=Serine, 
T«Tlireonine, V«Valine, W=*Tryptophan, 
Y^Tyrosinc, X^Unknown, ♦^Stop codon, 
/=possible nucleotide deletion, \FpossibIe 
nucleotide insertion 


1124 


2474 


A 


9173 


3 


374 


"GPSPSLLVLLPQEPGGTGTPVRAGAGAGMWL 
WEDQGGLLGPFSFLMLMLLLETRNPVNACLL 
TGSLFVLLOVFSFEPVPSCRALQELKPRDRISA 
T A HTJ nnu HnpPFNTT .G AIR/OGS* ♦ WSNRR 


1125 


2475 


A 


9179 


704 


188 

233 


ESSSGLLFQCFQGIHVQKLTLQARPTLFSWWL 
CSKPPKETGELENAESGGDGGRROGKQDNV 
AWWRRM^QKG\DFPWDDEDFPQSGPFGQQA 
LPMGFFYLYFRDPGREITWKHFVQYYLARGL 
VDRLEWNKQSVRVIPAPGTSSEVRGEFKAE 
YCRHKHSCKNWFYFFO 
■ MEYMAESTDRSPGHILCCECGVPISPN 


1126 ■ 
1127 


2476 
2477 


A 
A 


9183 
9185 


153 
1 


321 


LTGQLGSIIXRVFSKbRAULOAkXLKA x K 1 M 
EYMAESTDRSPGHILCCECGVPISPNPAQY\CV 
ACLRSSFHIYHCIPKLFJHPFSKTSSSAFITPSHY 
LTFFSnS 


1128 


2478 


A 


9186 


183 


847 


VLKFLLLQTMDEQSQGMQGPPVPQFQPQKAL 

RPDMGYNTLANFRIEKKIGRGQ\FSEVYRAAC 

L\LDGWVALKKVQIFDLMDAKARADCIKEIP 

LLKQLNHPNVIKYYASFIEDNELNIVLELAJDA 

GDLSRMIKHFKKQKRLIPERTV WK Y r villus 

ALEHMHSRRVMHRDIKPANVFITATGVVKLG 

DLGLGRFFSSKTTAAHSLVGTPYYMSPERIHD 

NG 


1129 


2479 


A 


9190 


1 


370 


GTSWKIPSAAVSESSPNGAAYASGLPCGVRU 
PPWAGLALLPSPTLMALLRKr 1 VdoUJLUINUJ i 
RATTymVVATITRARIEDMRHSATALTRPD 
ATTAQPKLPVTTVCNRRANPGIPPSVL 


1130 


2480 


A 


9194 


131 


487 


AYLKRLPVPESITGFARLTVSEWLRLLPFLGV 
LALLGYLAVRPFLPKKKQQKDSLINLKIQKEN 
PKWNEINIEDLCLTKAAYCRCWRSKTFPAC 
DGSHNKHNELTGDNVGPLILKKKE 


1131 


2481 


A 


9201 


184 


605 


KELVDEKSERGRAMDPVSQLASAGTFRVLKE 
PLAFLRALELLFAIFAFATCGGYSGGLRLSVD 
CVNKTESNLSIDIAFAYPFRLHQVTFEG\PTCE 
GKERHKLAUGDSSSSAKTrUl VAurAT-ui oi-» 
AATGVYIFFQNKY 


1132 


2482 


A 


9206 


1 


852 


l3GGRAGAGSRDMGSTDSKLNFRKAVIQLTrK 
TQPVEATDDAFWDQFWADTATSVQDVFALV 

■nA A'crn A\n>rCCDC>JT ATT PVTrAVPm.VOCrA 

PAAEIRAVKbbor briL«A i lA^ J v rtisAj t vvj^^ 

ESGCHSEKEKQIVLNCSRLLTRVLPYIFEDPD 

WRGFFWSTVPGAGRGGQGEEDDEHARPLAE 

SLLLAIADLLFCPDFTVQSHRRSTVDSAEDVH 

SLDSCEYIWEAGVGFAHSPQPNYIHDMNRME 

LLKLLLTCFSEAMYLPPAPESWQHAITHWFSS 

FVSSENRHALPLFTSLLNTVCAYDPVEyOIPY 

NHLY 


1133 


2483 


A 


9208 


1165 


1463 


GPRARVQGFSGADIVKFMALGSMYLVLTUV 
AKVLRGAEPCCGPLKNRVLRPCPLPWLPPP 
HPQPSRGNPVGCLPTYKWYKILSWPLHSNS 

NVYFIV 


1134 


2484 


A 


9210 


66 


1586 


■ MAGAGPIOIRAI^APVAEEKEEAREKIMAAR 
RADGAAPAGEGEGVTLQGNITLLKGVAVIW 
AIMGSGIFVTFrGVLKEAGSPGLALWWAAC 
GVFSIVGALCYAELGTnSKSGGDYAYMLDV 
YGSLPAFLKLWIEailRPSSQYIVALVFATYL 
LKPLFPTCPVPEEAAKLVACLCVIXLTAVNC 
YSVKAATRVQDAFAAAKLLALALIILLGFVQI 
GKGDVSNLDPNFSFEGTKLDVGNIVLALYSG 
LFAYGGWNYLNFVTEEMINinrRNLPLAniSLP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uoice 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«=Alaninc C>=Cy^eme, 
D=Aspartic Add, EpGlutamic Acid, 
F^Phenylalanine, GMjlycine, H=Histidine, 
Msoleucine;, K=Lysine, Ls^Leucine, 
M=Methionine, N=Asparagine, P=Ptoline, 
i^=uiutanune, K^^Argujine, a^^ocnijc, 
T^TTireonine, V=VaIine, W=Tryptophan. 
Y=Tyrosine, X=Unknown, ♦=StDp codon, 
/spc^'ble nucleotide deletion, \=possible 
nucleotide insertion 














IVTLVYVLTNLAYKrrLS'lBQMLSSEAVAVDF 

GNYHLOVMSWnPVFVGLSCFGSVNGSLFTSS 

RLFFVGSREGHLPSE.SMIHPQLLTPVPSLVFT 

CVMTUTAFSKDIFSVrNFFSFFNWLCVALAn 

GNflWLIUTOKPELERPIKVNLALPVFFILACLF 

LIAVSFWKTTPWSVASDFTIILSGLPVYPFGV 

WWKNKPKWAPPGHLSPRPSCVRSSCMWPQ 


1135 


2485 


A 


9216 


40 


410 


RDRLPPAYPCRPWCWTALDVGNSPESQEM 
DLVAFEDVAVNFTQEEWSLLDPSQKNLYREV 
MQETUlNlj^SIGEKWKDQNIEDQYKNPRNNL 
RSIXGERVDENTEENHCGETSSQDPDDTLNK 


1136 


2486 


A 


9223 


3 


983 


RRKRRSRYRRCSRFPRPGPLAVSMPHAFKPG 

DLVFAKMKGYPHWPARJDDIADGAYKPPPN 

KYPIFFFGTHETAFLGPKDLFPYDKCKDKYGK 

PNKRKGFNEGLWEIQNNPHASYSAPPPVSSSD 

SEAPEANPADGSDADEDDEG\RGVMAVTAVT 

ATAASDRMESDSDSDKSSDNSGLKRKTPALK 

MSVSKRARKASSDLDQASVSPSEEENSESSSE 

SEKTSDQDFTPEKKAAVRAPRRGFLGGRKKK 

APSASDSDSKADSDGAKPEPVAMARSASSSSS 

SSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRK 

PKPERPPS SS S SD 


1137 


2487 


A 


9229 


21 


239 


LFPRLECRDPVTVNCTLNLPGSKNAPTTASQV 
GSTWNYRGGLPHPTNFFVKTGFRCSQAGLKL 
RGSREPPAWA 


1138 


2488 


A 


9231 


1664 


2 


TRSVGVNTCEVGWTEPECLGPCEPGTSVNL 

EGIVWHETEEGVLWNVTWRNKTYVGTLLD 

CTKHDWAPPRFCESPTSDLEMRGGRGRGKR 

ARSAAAAPGSEASFTESRGLQNKNRGGANGK 

GRROSLNASGRRTPPNCAAEDDCASPSSTNKR 

KNKPPMELDLNSSSEDNKPGKRVRTNSRSTP. 

TTPQGKPETTFLDQGCSSPVLIDCPHPNCNKK 

YKHINGLRYHQAHAHLDPENKLEFEPDSEDK 

ISDCEEGLSNVALECSEPSTSVSAYDQLKAPA 

SPOAGNPPGTPKGKRELMSNGPGSnOAKAGK 

NSGKXKGLNNELNNLPVISNMTAALDSCSAA 

DGSLAAEMPKLEAEGLEDKKNLGDKEKGKK 

ANNCKTDKNVPSKLKSARPIAPAPAPTPPQLIA 

IPTATFrrmGTIPGLPSLrrTVVQATTKSFPL 

mQPKPTIMGEPnVNPALVSLKDKKKKEKR 

KLKDKEGKETGSPKMDAKLGKLEDSKOASK 

OX-rl'Vjilr IJvL/rlbri MNIiVJi-f/\lN \ jJjO Jio\^ijtji\iTiiT*J 

IKAEADKVYTFTDNAPSPSIGS 


1139 


2489 


A 


9234 


207 


443 
328 


trrgqpwrrraaaagdlpgreaaaclpsc/as 
vtaavsgllvgyelgnsgallqiktllalsc 

rlbvjcMU V ooi^ V l\j/\LtL, 
MAQGNNYGQTSNGVADESPNMLVYRKV 


1140 
1141 


2490 
2491 


A 

A 


9238 


248 


535 


fveaavkmlgslvlrrkalaprlllrllrsf 
tlrghggasgrnvttgslgepqwlrvatgg 

RPGTSPALFSGRGAATGGRQGGRFDTKCLAA 
ATWGRLPGPEETLFGQDSWNGVPSRAGLGM\ 
WPWAAALWHCYSKSPSNKDAALLEAARAQ 
\NMQEVSRNRCALLHSAAVQEYGYGN 


1142 


2492 


A 


9245 


157 


466 


■ HLCFWFFVGLFLPEQQINtLFATLLRMAQGUD 
FAIXM^FLNTITKAQAmCEKLDKLDFIKIKTC 
CTSMDAIEKTEPLTKWTKAFVSHVSYKRLLF 
GICKEYSRQ 


1143 


2493 


A 


9247 


264 


115 


GLPQQTSnQPPGTPDGARDFTSTIQPPGAPDG 
ARDSTSnRMGPEIPPP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc OCystcine, 
I>=Aspaitic Acid, E^OIutamic Add, 
F=Phenylalanine» G=Glycuie, H«=Histldin^ 

fslcnfriicinft K<sLvsine. LpLcucinc. 
M=Methionine, N=Asparagine, Ps^>roline, 
Q=Glutamine, R=Arginine, S^erine, 

TesThrenniTie VsValinc WtaTrvnffmhnn. 

Y«Tyrosine, X=Unkiiown, ♦«Stop codon, 
A=possibIe nucleolide deletion, \Fposstble 

nUbJCUUUC ujavjuuu 


1144 


2494 


A 


9260 


1 


401 


KKVPGRI^EMSFSLNFl'U'ANTTSSPVTVDCXJP 
SLGLAAGIPLLVATALLVALLFTLIHRRRSSIE 
AMBESDRPCEISEIDDNPKISENPRRSPTHEKN 
TMGAQEAHIYVKTVAGSEBPVHDKYRPTlENr 
ERRR 


"mT 


2495 


A 


9264 


175 


411 


METIVVT^^QFRLMGDSTVGKSCLLHRFTQGRF 
PGLRSPACDPTVGVDFFSRLLEIEPGKiaKLLL 
WDTAGQERHSIT 


1146 


2496 


A 


9277 


592 


814 


MFTYLEGREGKSQPKMEPHSVnRLECSGMI 

SAHCSLNLPGTSDSPASASR/VAGTTGMRHHA 

WLIFAFLVETGF 


1147 


2497 


A 


9279 


1255 


2 


iFRRGRRGEEEKEEEEEEEEGWVNGMENSHPP 

HHHHQQPPPQPGPSGERRNHHWRSYKLMIDP 

ALKKGHHKLYRYDGQHFSLAMSSNRPVEIVE 

DPRWGIWTKNKEU-ELSVPKFKIDEFYVDQV 

PPKQVTFAKLNDNIRENFLRDMCKKYGEVEE 

VEILYNPKTKKHLGIAKWFATVRGAKDAVQ 

HLHSTSVMGNIIHVELDTKGEmiRFYEL\LV 

TORYTPQTLPVGELDAVSPIVNETLQLSDALK 

RLKDGGLSAGCGSGSSSVTPNSGGTFFSQDTA 

YSSCRLDTPNSYG/QGTPLTPRLGTPFSQDSSY 

SSRQPTPSYLFSQDPAVTFKARRHESBCFTDAY 

NRRHEHHYVHNSPAVTAVAGATAAFRGSSD 

LPFGTVGGTGGSSGPPFKAQPQDSATFAHTPP 

PAQATPAPGFR 


1148 


2498 


A 


9302 


1026 


6 


lASIQNADTMPGVGLLVSHFSTLVSRQRCPNY 

ADPQNLTDVSIFLLLEVSGDPELQPVLAGLFL 

SMCLVTVLGNLLIILAISPDSHLHTPMYFFFSN 

LSLPDV\GFTSTTVPK\MIVDI\QSRSRVISYAG 

CLTQKSLFAIFGGTEEVNMLLSVMAYDRFVAI 

CHPLYHSADvlNPCFCAFLVLLSFFFLSLLDSQL 

HSWIVLQFTIIKNVEISNFVCDPSQLLKFACSD 

SnNSIFIYFHKDPERQLVIJluLl'i-aMi^ I vi- 

GNLinLDVSPDSHLPTPMYFFLSNLSLPDIGFT 

oTTnronrKjff^mTnotlrflBVTlJVArtPl TOMQI FATF 
S J 1 VriUVllVUil^DtlUiKVlr i AOUblv^iwCwurmr 

GGMEERHAPECDGL 


1149 


2499 


A 


9303 


1 


699 


MASQEKDMGWGTIHLFRKPQRSFFGKLLRE 

SSTNSIALTNSYTVLTffDLFSLMTCUSYWVTL 
RKPSPVYSFGFERLEVLAVFASTVLAQU3ALF 
ILKESAERFLEQPEIHTGRLLVGTFVALCFNLF 
TMLSIRNKPFAYVSEAASTSWLQEHVADLSR 
<5T rnrrpoT ssiFLPRMNPFVLIDLAOAFALCIT 
YMLIEI 


IISO 


2500 


A 


9308 


797 


693 


DRSTSVTRAOVQWCSLGSLQPRTPGLLRSStX 
SLP 


1151 


2501 


A 


9309 


205 


406 


VADCELPVLWKWSKPTRNTAKEPPQTQQRAU 
SKTAAPPCQWSRMASEGPNIPCPGARHSDKQ 

Fucn 


11S2 


2502 


A 


9314 


913 


504 


KPSPLITPPAVVLPPSAVLNLVNTFSSFPQVtV 
QGPLCGPRKGRLAVTIPFFGLS/LPKYMDHRR 
PPPHR\EIFFVFLAETGFHRASQAGPDLPTS/S/I 
PPTSA/FPKCWBYRSEPQCLPGCLSFSGILLDL 
GTNVSLRAA 


1153 


2503 


A 


9315 


392 


1 


"HPHRPRPGFRSPARSSRPCPVLTSLLPPFPSPJil' 
PADDLVKAGRDRKDPQVR/ERRLRPNPGRLG 
GPRNPRPARARS/CHQPRLTRVCPRSPPPEARA 
PAPAAPARGRGAPKRNRPRTDTRAPRGSSAR 
PONS 
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SEQ ID 
"NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
ri\J. 01 
peptide 
seq- 


Mei 


in 

USSN 
09/496 
914 


nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Piwlicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Ammo acid sequence (A«Alaninc OCystcinc, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phcnylalanine, G=Glycine, H^Histidine, 
Jslsoleucine, K=Lysine, L?=Leucine, 
M^Methionine, N-Asparagine, P^Protine, 
Q=^lutamine, R»Ai:ginine, &=^Serine, 
T"Threonine, V-Valine, W»Tiyptophan, 
Y«Tyrosinc X^Unknown, ♦=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 


1154 


2504 


A 


9321 


331 


433 


MPCI/QAQYGTPAPSPGPRDHSASDPLTFEFIK 
PT 


1155 


2505 


A 


9324 


180 


275 


MEEPQSDPSVEPPLSQETFSDLWKLLSENNYL 


1156 


2506 


A 


9326 


383 


619 


MISPSRTEGDPLPLPP/EGEC5QEVRGFGGGPAK 

EAAQRHCRASVSILRMRRPOQGSSRPARVPL 

RGPDSHRLREPPPSPP 


1157 


2507 


A 


9327 


152 


292 


YERRGRSQGGGSHPAGAQPGGRAIQAGWQS 
KEPLWEGLQRSGSPLPG 


1158 


2508 


A 


9328 


1 


430 


QELKQGPNPLAPSPSAPSTSAGLGDCNHRVD 
1 QKTF^V^^AT AMLOFRRCLYWLTDSRCFL 
VCMCFLTTIQALMVSGYLSSVrmERRYSLKS 
SESGLLVSCFDIQNLWWFVSYFRGRRRRP/ 
RVAAVGGLLDLEGGEMI 


1159 


2509 


A 


9334 


108 


383 


KGNQVNGNGNQLKRKHESMCPVSLTQNTVR 
T KiTCAi^i PAi^nAT^p Anm pragi.vtyvklder 
VLNALXYSSVGLQWFKESDLSHLRLLEISFR 


1160 


2510 


A 


9338 


2 


430 


■ FVGRPRGLSDRLEDLFLAOTRVGERLRTAAM 
KRYVRILLLGEGAEHVADPVPGGRGVPRGEA 
DHTDQELREEIHICANVERWHDVSQEA'nEKI 
RTfCWIPLV/RWGDHA/EGPVGIKSYLPSGRSM 
EAELPIMSQLTEIETCVEC 


1161 


2511 


A 


9341 


1 


390 


NSRVDDFVAPGLSEAGKLLaUifJl'iiKyKi.AA 
AVG/CSPMSGVISMSAPFn.GKffl)AIYTNPTV 

nvcriMi TPl n fH ^rtVTTT fYrAAAWAIRVYLM 

QTSRQRWKRLRTSLFSSILGQEVAFSDKAGT 
GELI 


1162 


2512 


A 


9343 


84 


837 


QGRFRAFCWQRDFLQFPGMRLSALLALASKV 

TT Di>TrvTJVrt\>rQPPn^VAnKRlCNPPWIRRRPV 
1 jL#rx^rl I ivi vjiVioi^r\ji>v/iJux^J^ wuxivna ▼ 

WEPISDEDWYLFCGDTVEILEGKDAGKQGK 

WQVIRQRNWVWGGLNTHYRYIGKTMDYR 

GTMIPSEAPLLHRQVKLVDPMDRKPTEIEWR 

FTEAGERVRVSTRSGRIIPKPEFPRADGIVPET 

WIDGPKDTSVEDALERTYVPCLKTLQEEVME 

AMGDOETRWRRSIGIEPOAEQLLPNFCPSLE 

G 


1163 


2513 


A 


9346 


967 


616 


DSLALSPRLECSGAISAHCNL'lPPGK!PJt''SCi-S 
LPSSWAYRCASPHPDNFFVFLVESGFHHVGQ 
AGLKLUSSDPPTSA/FPKCWDYRRDNSSAPAT 
FSSYQRNNPDULNDTIMPNIK 


1164 


2514 


A 


9347 


3 




<?SFPTCMRTVFHSNTSVSSLLHRPGHV'rPQL'ri 

HGGWRHHR0HTAIDEWDFNPSKFIJYTCLLL 

FSVLLPLRLDGIIQWSYWAVFAPIWLWKLLV 

VAGASVGAGVWARNPRYRTEGEACVEFKA 

MLIAVGIHIXUJvlFEVLVCDRVERGraFWLL 

VFMPLFFVSPVSVAACVWQFRHDRSLELEDLC 

SVNILQFIFLUJaJ)RnHWPWLVVFVPLWIU^ 

SFLCLVVLYYIVWSLLFURSLDWAEQRRTH 

VTT^SWnWVPLLTFEVLLVHRLDGHNTFS 

YVSIFVPLWLSIXTLMATTFRRKGGNHWWF 

AIRRDF/CQDQLPQPTGKPPPPPLTDHHGEKA 

LPLONKDRGSWPASRGSPRLL 


1165 


2515 


A 


9362 


547 


991 


DVSIGPPLLRRPCSGREQraSLSFPSDPESSFSF 

VPEGVRLADGPGHCKGRVEVKHQNQWYTV 

CCTTGWSLRAAKWCRQLRCGRAVLTNQKRC 

TKHAYGRKPIWLSQMACSGPEFILHDCPFRP 

LGEDTLFHYEYTSVHGRERLSAKD 


1166 
TlS? 


2516 

Tiii 


A 

■A" 


9363 
9368 


201 
707 


387 
1087 


PPILRWTPPSGKNFFF'FFFF'ESEFY/SSPRVECS 
GAISAHLAHCNLCLPGSSDSPASAFQVAS _ 
AVLTPCLSPCSPSRIPRP\SRPYPORRSLSHTPP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq. 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine U=t:ysteine, 

D^ASpaniC ACIQ, i;=vJluuuim» n.wu,^ 

F^Phcnylalanine, G=Glycine, H=Histidine, 
l=Isoieucine, K=Lysine, D=4-eucine, 
M=Methiomne, N^Asparagine, P=^Proline, 
Q=Glutamine, R'-Arginine, S«Serinc, 
T=Threoninc, V='VaIlnc. W»Tiyptophan, 

V— T<fMei«t» Y — 1 1n^nnu/n ^=:fitnn COdOtL 

I— lyrosinc, a — uiixjiuwu, %jixiy \«uuut<« 
A=possible nucleotide deletion, \=po5sible 
nucleotide insertion 














PRPLILYAPAPVRPAGTAFIPHSHPPPPDLLRPT 
PLPFPPPSS/RPPRPDCSTSYSPTFPPPT 


U68 


2518 


A 


9375 


511 


15 


MMLSEETSAVRPQKQTRFNCiAKLVWMLKGS 

PITVTSAVnVLMLLMM/IFSPWLATHDPNAID 

LTARLLPPSAAHWFGTDEVGRDLFSRVLVGS 

QQSILAGLVWATTGMIGSPLECLFGELGGRA 

DAIFMRVlvroiMRS/IPSLVLTMEKTAALGPSL 

FNAMOASSEH 


"1169 


2519 


A 


9377 


42 


410 


GNGRVAPRDPGAVASAEPGLTTHDSGVNPN 
NSARRMEAMASGSNWLSGVNVVLVMAYWS 
LVFVLLFIFAKRQIMRFAMKSLRGPHGPVGH 
NAPKDLKEEIDILL5RVHNIKYEP\HLLADDDA 


1170 


2320 


A 


9378 


302 


1303 


GVSGFSASVLRQRRMEDELEPSLRPRTQIQCjK. 

ILLLTICAAGIGGTFQFGYNLSIINAPTLHIQEF 

TNETWQARTGEPLPDHLVLLMWSLIVSLYPL 

GGLFGALLAGPLAITLGRKKSIXWNNIFWS 

AAILFOFSRKAGSFEMIMLGRLASWGVNAGV 

SMN1QP\MLPGGESAPKELRGAVAMSSAIFTA 

LGIVMQQWGLSTTAATGLRGL\AGELEELEE 

ERAACQGCRARRPWELFQHRALRRQVTSLV 

VLGSAMELCGNDSVYAYASSVFRKAGVPEA 

KIQYAIIGTGSCELLTAWSVSLEGALPPPAL 

WGGTPRSFALNOraQKKKK 


1171 


2521 


A 


9381 


2 


412 


RGPASAQEDERARTAPLERVRARGRMTl^iSA 
LFPSLLPCSWSTSNKYLAEFRAGKMSLKOl lb 
TPDKRKGLAY/IQQTDDSLIHFCWKDRTSGNV 
EDDLIIFPDDCEFKRLPQCPNORVYVLKFKAG 
SKRLFFWMQEP 


1172 


2522 


A 


9384 


20 


355 


GWNGRSTEASPAAEAPHWHKETVKAAMij ly 
CmGGKWDPHDMLTTVVHKIKlJVLCHSL 
LQLCAIMISDYLKSSirrVEKRLGLFRPTSGLL 
ASFNEVGNTALIVLESY 


1173 


2523 


A 


9393 


430 


87 


LCQCIVPGQQKJjTtSLNf bSAi vju- iL.'L,oi-vi 
QRKEDQ*IIL*YHLNKDCLHIFMSAITLYMKI* 
KIFVLFDFNIMFETPFY1I*FIFLFSQNLKRIRQV 

IRPPISFSKINNGP 


1174 


2524 


A 


9397 


77 


374 


ERLEIGRLGGERGSGP AbCLK V v oum w i/v 
RLVKLALLQLLRAFYGIKVKGVRVHRDCX}TF 
ESSSTLIRVS*FGVPCNALAHFGVTHF*Y1LDF 

LGML 


1175 


2525 


A 


9399 


66 


397 


HESSRADRDKMJU 1 Kuo 1 1 i iJi\ur v i^iwwv* * 
AKMNKWSKGKVRDKLNNLVLFDTATYDia 
CKEVPNYKLITLAWSERLKIPGSLARAALHE 

LLoKvjL»l 1^ V H^ril/\V v n 


1176 


2526 


A 


9408 


2 


299 


■ LDLTHVLSLSISLTVTILGTrFGMVIPLLDVVY 
GERGYAQNGDF'DAQLDDYSFSCYSHAQVN 
GAPNSLTRAVDDP*VKISGLECQKVGALVEV 

KCLNL 


1177 


2527 


A 


9416 


2 


402 


" "CNFLRSSRIRVHSTPAASTMPPKVDPNEIKVV 
YLRCTGGEVRATSALAPKIGPLGLSSIKVGVD 
FV^ATGDWNVLIISVILTIRILLSHIFVVPPFFCF 
DHLIAFWDLOSLIFLHVIFSLFITIXLFCFFSIF 


1178 


2528 


A 


9419 


142 


426 


" "TPLFDLWPRWLSWLETVLTSLRTORAAbUi'f 
ACRIMPTTVDDVLEHGOEVHFLQKQMLYLL 
ALI*DTFAPIYVGIVFLGFTPDHRCRSPGVAEL 


1179 


2529 


A 


9420 


1450 


"1655 


■ ■ LSSAGTKMNLN*KNYWPGASAHACNPSTLG 
GQSRCrrRSGDRDHPG*HGETPSVLKIQKISRA 

WWRAP 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 



1180 



1181 



SEQID 
NO: of 
peptide 
seq- 
uence 



2530 



2531 



Met 
hod 



SEQ 
ID NO: 
m 

USSN 
09/496 
914 



9422 



Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 



176 



9436 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



Amino add sequence (A=Alanine 0=Cysteine, 
D=Aspartic Acid, E=Glu1amic Acid, 
F=Phenylalanine, G=Glycine, H=Histidinc, 
I=Isoleucine, K^Lysine, L=Leucine, 
M=Methioninc, N^Asparaginc, P^Prolinc, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=UnknowD, '^Stop codon, 
/^possible nucleotide deletion, >r^ossible 
nucleotide insertion 



375 



274 
240" 



HRPQTTRPDWKPRT*PQGK*GRLSSEISPASPP 
SRFSRSTKPVPPKADPPARQKLTGNOJIAPLLK 

L 



PIAASLRMYNlXJPY'l'EBNLlCTAFATMVETYP 
IARTILDRLTGIPHGYCFVE*ADWATADKCVH 
lYNGKPLPGATPLLSLQLHQLAHLGS 



1182 



2532 



9442 



1183 



2533 



1184 



2534 



1185 



2535 



VDKCSSKSIVLSEYCPHCMCSLSTDPKPFGQL 
SMILK*MGAGDEKISAMGKAR\a)HRELYLGL 
LYPTEDYKLTFRARH 



9444 



384 



9462 



391 



9467 



215 



LKDFQPWALHDWPLFCCCTFLLFLVLECFTR 
KGCSGWAPWLSLQCQHFGRPRWADHLRSGV 
RDQPGQYSKTTFLPKIQKLAGHSGAHL*S*LL 
ERMRWKNRJLNPGGRSCSEPRWHHCTPGWAT 
ERG 



655 



5sr 



LSGFKSLMPKIPLQYIYVRVRTTWSFCLPLDG 
RKLMLS*YSK*LT*KYNILPEYSRMTLPPGMV 
I HTCNPSTLGGRAGWIV*AQEFET 
RCPMWQGQASRisdDPAKAJKDREASlx;cSLA 
WWWGWECWVRALKLSSGPAGPLACWVAK 
KKSLSLSGPVYPSEKGAGLYVF*DRVSLCHPG 
WSAWQFWLTAASNSCFSLLSS\yDYRCA 



1186 



2536 



9468 



275 



452 



2537 



9469 



388 



1188 



1189 



2538 



9471 



124 



397 



2539 



9480 



584 



769 



HIPQLHTKTHYVPTRMVNKI^QIDNSKFWQR 
G G*TGILTHCW*ESKLVQPLWKIVWHYQ 
EVAPGPSQILPRRVTDGGDRPQFSLPGPRLPg 
SSRGAEPCLSNCIHSPAPRKQRMGDSDQ*STP 
NPASPHPEAPQEPWDSASGSVOSFSLGRGAK 
ASS*WOKGRGPRQGSELLAETILELFLALAN 

S 

TMDKKNRHGNSLDMASEIHMTGPMCUEN'll 

GRLMANPEALKILSAITQPMVEEAIAGLYRAC 

♦FYLTNNLAGMKKGLCLOSTEOAHTIGI 

GHVQSQHFGRPRRADHLRSGDRDHPG'HDET 

PSLLKIQKISWAWWRAPWPATWEAEAEEW 



1190 



2540 



9483 



463 



IT 



VTVGLTLLLRGAPRFTAG*PPSGGGPPLAFLL 
PRQHCTLQTHRHLHPEAPVKV*ICrRLFPGLR 
GASSCRRRRCNPVLAARKAGSPRSHSTRENC 
RRSRCPPTAHRRRRRGRRRNPSCVRSPRWR 



1191 



2541 



9489 



411 



"LADALCLSAAATGAVRPGARAQPSTOKKLSiF 
SVRVCCRAAAASNLLYSSCLQRHSERASEEG 
ERGSLSAKCCSLVLRGGCSSSNSHSFRRIT*EI 
MAAFVLLSYEQRPLKRPRLGPPDVYPPDPKQ 
KEEELTAVNVK 



1192 



254r 



9497 



389 



161 



VSFLSMSSGHCIRSTRGSKMVSWSVIAJUgiil' 
CEEDERKMAREFLAEFMSTYVMMNIHMIVE 

KDTYSDHEEINTS 

IAKSQ*KRWQRSGAMErLKHGWWECiaVQh 



1193 



2543 



9509 



186 



1 

43r 



FGKTFA^NVN*S*TYVYPCDKnLLLGLYPTEM 



1194 



2544 



9512 



58 



PLQRSKCLTLRCLRAKPWAWSQSPRACSSAL 
LKSSRSRASSLNVQCILQSNPQGHQRI*KQKA 
SSKGQQFRR*KEHPFMLKTLNKLRIEGrLKI 
RRAIYDNFTANDVEOQKLEAFFLRTGTRQ 



1195 



2545 



9515 



595 



1223 



GHGAFSFQTQVPRTP*ASWPVVPAASE5AFAP 

AGGGASLPVAAGSCAAAPHTEPGAPQHLLDC 

PCPLCLARPPRRPLPDTCYGPGSGRSASLAEPP 

LPRCSCAPLRSASAPQVS*CV»AVNLLPHNL* 

PLHaLHD*EKAWOFLFSSASHCFQGQICLLP 

APGSGPCGATARFSRGGRAGGSRARRPIPPGP 

GTRRTPSGCQNPAASGG . 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
914 


Predicted 

beginning 

nucleotide 

location 

corrcapunui 

ng to first 

amino acid 

resiouc oi 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 

fe\ liict nminn 

i\l Itwl ill III 11 W 

acid residue 
of peptide 


Amino acid sequence (A=Alanine C=Cysteine, 
n=AsDflrtic Acid. E^=Glut&niic Acid. 
F«Plienylalanine> OOlycine, H»Histidine^ 
l=Isoleucinc Ks=Lysine, LpLttidne, 
M=Metfaiomne, N=Aspara^e, ^Pioline^ 
Q=Ghitamine» R"ArginiQ(^ &»Serin^ 
T=Threonmc V=Valinc W»Tiyptophan, 
Y=Tyit>siBe» X^Unknown, *«Stop oodon, 
/=pos^ble nucleotide deletion, V^ossible 
nucleotide Insertion 


1196 


2546 


A 


9518 


229 


468 


RSPTATPAPHAMGPGAFFARGORPLPLLUAM 
AERV APG WDLHTPYLPRTNSRRTPHL* ♦EPHA 

GYIGALFPMSGGWPGGQ 


1197 


2547 


A 


9521 


289 


448 


lAWLSULPFPSNQANLCFLCYKLTADSRYRG 

HAMRHLTGNTSMAIRFL*ADSRFQVQRARYE 

APNWKYKYGY*IPVDMLC 


1198 


2548 


A 


9524 


204 


I 


KNKKTTKCLSIVTLNISGPNQ^NKimRVAEWI 
VKQEPmCHL^ETHFPFRDTYRLKEREQKKRlC 


1199 


2549 


A 


9546 


1785 


1943 


GGRFKESKLTNAGWQRNSFFIGPPKSIPWAA 
V*QRGDGKNPGVTHLNRPVGTX 


1200 


2550 


A 


9548 


186 


1 


VNAEKEF*KIQHYFMTKSQNKLHIEHTYLKPI 
KAIYDKWTSDIMLNLQKL*AFFLRVrVRQI 


1201 


2551 


A 


9549 


591 


2 


SSVVEFPRGPRSSIJ'rLUblrrL^u^arNW iuuu 

GSCPSGE*LVSPGSEQRKKYSNSNVIMHETSQ 

YHVQHLATFIMDKSEAITSVDDAIRKLVQLSS 

KEKJVVnTQEMIJ^Q VNDQbLRLLiJliio i^J^ 

PLPTVQRSQTVLNQLRYPSVLLLVCQDSEQSK 

PDVHFFHCDEVEAELVHEYMESALTDCRLGIC 

AMRP 


1202 


2552 


A 


9552 


428 


1 


KYGNEGHWSRQCPNPCrJU^lRP^Jfl-tJKUrtiWK 
LDCERPPQGPLPSLPELAKTSYSDLTGLATED 
♦WGPGMDAPATTIASSKTRVTLMVAGRPVFF 
U*YRATYSALPNFSGPTQSSQVSWGIDGQV 
SKPRAi rPLi* ubLH 1 r 


1203 


2553 


A 


9568 


517 


738 


RRKFERKQKQ*RYREGKQYRQRDKMKEWG 
EKEKJEUIREKGEREERKMRHRERKOESOQW) 
TMENWRVERLTEKER 


1204 


2554 


A 


9573 


83 


415 


EDKRLRLVDGDSRCAGRV^IYHDGFWGTICD 

rv^iirr\T or\ A u\/ vrVM^ I nPnVAFMATV^lAHFG 
DGWlJLbLJAHV Vvl^JSJuvjv»OY/u^w vo/vru'VJ 

EGSGPIWLDDLNCTGTESHLWQCPSRGWGQ 

HDCRHKEDAGVICSEFTALR ' 


1205 


2555 


.A 


9577 


64 


424 


ARGSCPTRPRTANGRMGETKDAPQMLVTFK 
DVAVTFFREEWRQLVLVHRTLYR*GMLETC 
T r\TT PXTKTV/pnpriWHT I YHGTOl LIVKRE 
VSHSPCAGDMRELFTREATLTPHFYNNGA 


1206 


2556 


A 


9584 


38 


476 


SNFITPSSPRLKP*TASSQRNLGQILNMFLTAV 
NPQPLSTPSWQIETKYSTKVLTGNWl^RRK 
GLPYKHLITHHQEPPHRYLISTYDDHYNRHG 
YNPGLPPLRTWNGQKLLWL 


1207 


2557 


A 


9586 


2 


412 


1 P«!<;PAAT 1 TlALtirrTVTGTALALRSRVA'l'lT^ 
PDGCRNVLRPKYYRLCDKAESWOIALETVPT 
GVAVTSWAIMLTVLTLVCKGQDYNRRQKLP 
THIIX:iX»EKGIFGLTFAraGlJDGSTGPrRFFL 
FGELFSICFS 


1208 


2558 


A 


9597 


122 


3 


KNYWPGMVAHACNPSPLGGRGRWIA'AgK 
FADAWADAW 


1209 


2559 


A 


9611 


148 


558 


KSUiNVWDIXNNTWKADRFFCHSSRTSllKK 
GDPGPTFSKMSIWTSGRTSSSYRHDEKRNIYQ 
RIRDHDLLDKRKTVTALKAGEDRAILLGLAM 
MVCSIMM*FIXGmXRSYMQSVWTRESQCT 
LLNASITETFNC 


1210 
1211 


2560 
2561 


A 

A 


9618 

9620 


384 

316 


2 

610 


SLHDMLMLAEQQQKQKWAVNTQNTAWSNA 

DSKFGQRILEKMEWSKGRGLGVQEQGGPDDI 

KVQVKNNDLGLQAHNNEANWIAHQDDFNW 

LLAELNTCQRQETADS*»*WSPKNSHVOKDS 

GELSAK 

QKHP00GQL0RSPQEDSRFHNKASSGV5RVR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucicuiiuc 
location 
corresponding 
to last amino 

of peptide 
sequence 


Ammo acid sequence (A=Alanine OCystcine, 
D^Aspartic Acid, E^lutamic Acid, 
F=Phenylalanine» G=Glyclne, H=Histidme, 
I=Isolcucinc, K=Cysine. LpLcucine, 
M^Methionine, N=Asparaginc, P=PjT)linc, 
Q=Glutamine, R=Arginine, S=Serinc, 
T«Threonine, V=Valine. W^Tiyptophan, 
Y-Tyrosinc, X=Unknown, ♦^Stop codon. 
/B=possibl6 nucleotide deletion, \=po5sible 
nucleotide insertion 














"LGRAWWLTPVlFrLWHAKAGGSPE*D»AGRG 
GSRL^SQHFGRPRRVDHLRSAVQDQPGQHGE 
TPSlXKIQKIN*VWGRia*SSYSEAEAGESL 


1212 


2562 


A 


9623 


297 


344 


"QFPVIXjDYQmOTQLFQAQNLSLCLAMTR 
TREL*KGGGKGRHE*AWPFLKKGGYGVKAP 
AILNTSNCT^CF^FTKMLSDDPKACVFEVSSA 
DL*NTSFOVIR 


1213 


2563 


A 


9624 


2 


356 


AELSLASTACGRNTSGDSLPDYDRAFISSPLA 
T^arw SAISCLWDLPTPVLRVGLSCQPSMSSQ 
IPRMYSTDVEAAVNSLEDLYLQAYYAYLCVG 
LYFHRDDMALEGVSRFL*ELAE 


1214 


2564 


A 


9634 


776 


912 


SLSRWVRAKL*VPYNQENCLNPRGGGCSEPR 
SHYCTPAWATEKDS 


1215 


2565 


A 


9636 


220 


426 


EENFGEKLHDIGFGNGFLDKT*KAQATKAKI 
DK 


1216 


2566 


A 


9637 


391 


76 


" CFLEDGCTQAS*AEEAAVSPSMAJiEEQGSTSC 
RERRSIRFKMKNHSPDDTIKENVTISNIRTRKI 
vitn otTT'tTDxiT T ttixnj mvtut MAAFnSLVAHS 

Is HLr fc I £jKr4 LLlirlVJJUiyi I UvJUririrLr t rknu 
LFGFE.KAT 


1217 


2567 


A 


9655 


2008 


24i2 


LHCKMGALETQTHPCSQNMLRSLQKCCCKV 
EEHHLQPVQVLQTLLHSATAGTGCRRPARPP 

r» A nrrTDTDii/TJ QT3 rtQ^ilf n<!FR AS*1 KGRGRYGL 

GALGGRGGRALGGSRWPPPLPGETLFSGCKH 
RRRRRGSDAAPGEEAGT 


1218 


2568 


A 


9658 


3 


405 


"HASARALLSPNLSPNNKMAISGGPVLGFFIIA 
VLMSAQEPWAIKEEHVHQAEFYLNPDQSGEF 
MLDFEGEDTFHGDMAKKETVWRLE*LARLD 
vroi: A ni} A T A>JT A A DO A ALGIMDMOSDYTL1P 
NVPPKVTVL 


1219 


2369 


A 


9662 


3 


284 


"PDWTEKRKMQDTGSILPLHWFGFGYAALVA 
YGGIIGYVKAGSVPSLAAGLLFGSLSGLGAYQ 
LSQDPRNVWVFLATSGTIAGIMGMRFyHSG 


1220 


2570 


A 


9669 


200 


"699 


Tlltgyiqtlqnqqlsgnqqemqavdnltsa 
pgntslctrdykjtqvlfpllytvlffvglitn 

GLAMRIFFQIRSKSNFIIFLKNTVISDLLMIL'IP 
PFKTLSDAKLGTGPLRTFVCOVTSVIFYFTMYI 
SISFLGLmPRYQKTTRPFKTSNPKNLLGAKIL 


1221 


2571 


A 


9676 


164 




KERDSSTFSAAMTIMQGMEQAMPGAGPGVF 
QLGNMAVmSHLWKGLQEKFLKGEPKVLGV 
VQILTALMSLSMGITMMCMASNTYGSNPISV 
YlGYnWGSVMFnSGSLSIAAGIRTTKGLVRG 
SLGMNTTSS 


1222 


2572 


A 


9688 


43 


412 


■ ■ VAKMVKCCSAIGCASRCLPNSiaKGLTFHVl' 
PTDENIKRKWVLAMKRLDVNAAGIWEPKKG 
DVLCSRHFKKTDFDRSAPNIKLKPGVIPSFDS 
PYHLOGKREKLHCRK>nTl»KTVPATrnrNH 


1223 


2573 


A 


9696 


308 


564 


" "RTSMGILYSEPlCQAAYQNDFGQVWRWVKb 
DSSYANVQDGFNGDTPLICACRRGHVRIVSFL 
LKKECLCQPQKPERENLLALCCE 


1224 


" 2574 


A 


9700 


3 


632 


' ■ DAWASGGELGSLFDHHVQRAVCDTRAKYKt; 
GRRPRAVKVYTINLESQYLUQGVPAVGVMK 
ELVERFALYGABEQYNALDEYPAEDFTEVYLI 
KFMNLQSARTAKRKMDEQSFFGGLLHVCYA 
PEFETVEETRKKLQMRKAYVVKTTENKDHY 
VTKKKLVTEHKIOTDFRQDFHSEMSGFCKA 
ALNTSAGNSNPYLPYSGELPLCYFSSK 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


OJD\< UJ 

NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beguming 

nudeotide 

location 

copespondi 

no to first 

amino add 

residue of 

pq}tide 

sequence 


Predicted cod 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine 0=Cystcinc, 
D=Aspartic Acid, EFGlutamic Acid, 
F=PhenylaIanine, G=Glycine, H=Histidine, 
I=Isoleudnc, K=Lysine, L=Leucine» 
M=Methionine, N=Asparagine, P=Prolinei, 
Q=01utamine, R^Arginine, S^Serine^ 
T"Threonine, V-Valinc, W=Tiyptophan, 
Y-Tyrosine, X=Unknown, ^^Stop codon, 
/^possible nucleotide deletion, \=pDSsible 
nucleotide insertion 






A 


0710 


1 




RSGCVLRMTEWETGAPAVAETPDIKLFGKWS 
TDDVHINDISLQDYIAGVRLILL 


1226 


2j7d 


A 

A 


y/.ij 






QGLPSFLPAFGPSGSWLGPAFTLGSSCNTVDT 

ICHGYSEIRPLFYLSFC3)LLLGLCWLTmLYG 

ASVANKDnCYNLQAVGQIFYISSFLYTVNYI 

WYLYTELRMKHTQSGQSTSPLVIDYTCRVCQ 

MAFVFSSLI 


1227 


2577 


A . 


9720 


3 


416 


GKWKRTQVPaGEECADMDLARKEFLRGNG 
I^UGKMNISIDLDTNYAELVLNVGRVTLQEN 
NRKKMKDCQLRKQQNENVSRAVCALLNSGG 
GVnCAEVENKGYSYKKDGIGLDLENSFSNML 
fr yriiruUrmKiriyjrt x r 


1228 


2578 


A 


9723 


278 


411 


EASSSNTVASNVADKTDPHSMNSRVFIONLN 
TLVLQKSDVEAVF 


1229 


2579 


A 


9725 


121 


902 


Lr AMbijr cNLn 1 ur jk^Ioj dlULN^aVji^o x U if 

GGSGGPYSKQYAGYDYSQQGRFVPPDMMQP 

QQPYTGQIYQPTQAYTPASPQPFYGNNEEDEP 

PLLEELGINFDHIWQKTLTVLHPLKVADGSIM 

NETDLAGPMVFCLAFGATLLLAGKIQFGYVY 

GISAIGCLGMFCLLNLMSMTGVSFGCVASVL 

GYCLLPMILLSSFAVIFSLQGMVGIILTAGnG 

WCSFSASKIFISALAMEGQQLLVAYPCALLYG 

VFALISVF 


1230 


2580 


A 


9739 


11 


247 


TFVLNMNTPKEEFQDWPIVRIAAHLPDLIVYG 
HFSPERPFMDYFDGVLMFVDISGKCKRDVGX 
MWMSNRLAWEFTCRA 


1231 


2581 


A 


9744 


37 


1100 


ACRIMPTTVDDVLEHGGEFHFFQKQMFFLLA 
LLSATFAPIYVGIVFLGFTPDHRCRSPOVAELS 
LRCGWSPAEELNYTVPGPGPAGEASPRQCRR 

RDGWVYETPGSSIVTEFNLVCANSWMLDLFQ 

SSVNVGFFIGSMSIGYIADRFGRKLCLLTTVLI 

KAAAGVU^SPTYTWMLIFRLIQOLVSKAO 

WLIGYn-ITEFVGRRYRRTVGIFYQVAYTVQL 

1 VI AGVAYALPHWRWLOFTVALPNFFFLLY 

YWOPESPRWLISQNKNAEAMRIIKHIAKKNG 

KSLPASL 


1232 


2582 


A 


9753 


164 


517 


PGPGMQGPPPITPTSWSLPPWRAYVAAAVLC 
YINLLNYMNWFIIAGVLLDIQEVFQISDNHAG 

LLQTVFVSCLLLSAPVFGYLGDRHSRKATMS 
FGILLWSGAGLSSSFISPRYSWLF 


1233 


2583 


A 


9757 


25 


419 


IJPAPWTERVRKSEGLVGTCLGDPMASPRTVT 

rVAl^VAIX51iT?VmGTIKLTPRIi!KD 

KRAYKSYVRALPIXKKMGINSILLRKSIGALE 

VACOIVMTLVPORPICDVANFFLLLLVLAVLF 

FHQLV 


1234 


2584 


A 


9765 


71 


456 


RLEU?WGFSUffLPVAYLCPLSSGFEMNVQP 

CSRCGYGVyPAEKISCIDQIWHKACFHCBVC 

KMMLSVNNFVSHQKKPYOIAHNPra^n^^ 

VYHTPLMLNVRTFPEAISGIHDQEDGEQCKSV 

FHWD 


1235 


2585 


A 


9767 


52 


559 


IRSGAMSVDKAELCGSLLTWUJTFHVPSPCA 

SPQDLSSGLAVAYVLNQIDPSWFNEAWLQGI 

SEDPGPNWKLKVTSOLLIRGQTGEEMXRDGP 

ARHMSW\a^GRKRDRCLVINHLFIHSSMEYSP 

CARPGHSARNNTDKNU>HTAIILVTSNTYTTI 

KINFQAGRSGSCL 


1236 


2586 


A 


9770 


352 


608 


FRGEALTYRFLTKRHGEYASNFESIYKKHLC 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO; of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Ammn ncirl cennence fA°Aianinc O^Qysteinei 

/\Iililiu (wiu od|umvv \fv *»■•••■•••*' ^ •'•^••••j 

I>=Aspaitic Acid, BOlutamic Acid, 
F^^teiylalanine, G=Glycine,H=Histidine, 
Tslcnieucine. K=Lvsine. iJ^Leucine. 
M=Methioninc,N=Asparaginc, Pi=PioIine, 
QMjlutamine, R-Arginine, S=Serine» 
T=Threonjne, V=VaUne, W«Tiyptophan, 
Y^yiosine, X=Unknown, *^Stop codon, 
A=possible nucleotide deletion, V^possible 
nucleotide insertion 














LERKQLNLEIYDPCSQTQKAKFSLTSELHWA 
DGFVIVYDISDRSSFAFAKALI 


1237 


2387 


A . 


9793 


266 


515 


NIIjUIYFPFPRLFLLRDSQSNPKAFALTLCHH 
RYKPTRPVCXTQFQGCS 


1238 


2588 


A 


9802 


537 


967 


ELGAGRSDREAMEAAVKEEISVEDEAVDKNI 
FRDCNKIAFYRRQKQWLSKKSTYRALLDSVT 

INEETPLKPRFEVPDVLTSKPSTVRUSCSGDT 
GSLE-ADGKGDLKC 


2239 


2589 


A 


9805 


105 


"540 


VPGDPAMVRAGAVGAHLPASGLDIFGDLKK 
MNKRQLYYQYLNF AMI V biiAi^Mi w v l. i 
GSESPIVWLSGSMEPAFHRGDLLFLTNFRED 
PIRAGEIWFKVEGRDIPIVHRVIKVHEKDNG 
DIKFLTKGDNNEGDDRGSYK 


1240 


2590 


A 


9819 


3 


305 


TDGRDPLPCAARRRGGGGHCCGAGWVAiiWS 
PQPLDPAMLLWMQGFVLEAVACQDNDDYLR 
YOILFEDLDCNGDGyVDIlELQBGLRNWSSAF 
DPNSEEHG 


1241 


2591 


A 


9834 


841 


1209 


SPARGKSNRTDVMITAPKNKKMTENLAAPEA 
LDSSTHSSSTATQSRAKMNTPAPTTSTVPAPR 
GGSGGPPPCAPHDRVS S VLQCIXrQ AMDHJs. I n 
SSHSWEFLFKRTKTPSPFHPAVRENRN 


1242 


2592 


A 


9843 


3. 


589 


TISCGPATEPPASLLSSASSDDFCKEKTEDRYS 

LGSSLDSGMRTPLCRICFQGPEQGELLSPCRC 

DGSVKCTHQPCLIKWrSERGCWSCELCYYKY 

HVIAISTKNPLQWQAISLTVIEKVQVAAAILGS 

LFLIASIS wLIWSTFSrSAKW l^Kv^ULLr Viiu i u 

MYGFMDVMP/AVDSEDMVQAAKEVOKRWS 

DIPP 


1243 


2593 


A 


9846 


198 


411 


WiaSHHAGKMPVMKGLLAPQNTFLDTIAW 
DGTHSNFILANAQVAKGFPI VYCSDGFCELAG 

FARTEVMQ 


1244 


2594 


A 


9848 


116 


650 


PICGFLYLCSAMASESSPLLAYRLLGEEGVAL 

PANGAGGPGGASARKLSTFLGVWPTVLSMF 

SIVVFLRIGFVVGHAGLLQALAMLLVAYFILA 

LTVLSVCAIATNGAVQGGGAYCILQHRWTG 

VWPVLPAREVMISRTLGPEVGGSIGLMFYLA 

NVCGCAVSLLGLVESVLDVFGA 


1243 


2595 


A 


9849 


573 


1620 


KSKCRFPEGLSburOrMJtRJiAJLrODVJO v yiiAii 

AMLDEPQEQAEOSLTVYVISEHSSLLPQDMM 

SYIGPKRTAWRGIMHREAFNnGRRJVQVAQ 

AMSLTEDVLAAALADHLPEDKWSAEKRRPL 

vocT rivcrrpci T MPriPT^<5WnVYWDIEGAVRR 

YVQPFLNALGAAGNFSVDSQILYYAMLGVNP 

RFDSASSSYYLDMHSLPHVINPVESRLGSSAA 

SLYPVLNFLL YVPELAHSPLYIQDKDGAPVAT 

NAFHSPRWGGIMVYNVDSKTYNASVLPVRV 

EVDMVRVMEVFLAQLRLLFGUQPQLPPKCL 

I^GPTSEGlMTWELDRaWARSVENLATATT 

TL^TSL-A 


1246 


2596 


A 


9850 


114 


464 


PPQLGAQRVREPRHPDVRAPLRVTSPGLRbKb 
ARSLGRRPRIAMVTVGNYCEAEGPVGPAWM 
QDGI^PCFFFTLVPSTRMALGUALVL/kLPCK 
RRERPAGADSLSWGAGPRJSSYV 


1247 


2597 


A 


9851 


2 


327 


FVRNKKMTRSCSAVGCSTRDTVLSRliRCJUS^ 
HQFPTDTIQRSKWIRAVNRVDPRSKKIWIPGP 
GAILCSKHFQESDFESYGIRRKLKKGAVPSVS 
LYKVFKYSSRCTS 
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SEQ ID 
NO: of 
nucl- 
eotide 
scq. 
uence 


SEQ ID 
KG: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

ID 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

pq)tide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«AIanine OCysteinc, 
I>Aspartic Acid, E^OIutamic Acid, 
F=Phenylalanine, GNjlycine. H=Histidin^ 
I=lsoieucme, K^Lysme, L-L.eucine, 
M=Methionine, N=Asparaginc, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, W^Tiyptophan, 
Y«Tyrosine, X^Unknown. ♦=Stop codon, 
/^possible nucleotide deletion, \Fpossibl6 
nucleotide insertion 


1248 


259S 


A 


9853 


58 


444 


RVDDFVYSKGGKDAGGADVSLACRRQSIPEE 
FROrrWEUKKEGSTLGLTISGGTDKDGKPR 
VSNLRPGGLAARSDLLNIGDYIRSVNGIHLTR 
LRHDEIITLLKNVOERWLEVEYELPPPGGCF 


1249 


2599 


A 


9856 


2 


1265 


. LPPPRPSRHRRGRAGTRASAAAAAGPTVSAV 
RAPVRGQDSGAGTPQGRLAGRGAHLSRVGA 
SGSGVAAGPAARHAPRRRCADAGEAVGASC 
GRCAVALLSOVCTLVSTHVCVGSGCPGAAGT 
PMGAGDAGASAESAVTTAFQEPPARPLQAGS 
GAGPAPGMMRSTTLLALLALVLLYLVSGAL 
VFRALEQPHEQQAQREL'GEVREKPLRAHPCV 
SDQELGLUKEVADALGGGADPETNSTSNSSH 
SAWDLGSAFFFSGTmriGGGGDWHYGGGK 
ELPHGORCRETEOSQVAPRLPASPLCPOYGN 
VALRTDAGRLFCIFYALVGIPLFGILLAGVGD 
RLGSSLRHGIGHIEAIFLKWHVPPELVRVLSA 
MLFLLIGCLLFVLTPTFVFCYMEDWSKLEAIY 
FNOVTLTTVGFGDYVA 


1250 


2600 


A 


9873 


2 


652 


fwpspcggipgrapngasrptmgnsasrndf 

ewvytdqphtqrrkeilakypaikalmrpdp 

rlkwavlvlvlvqmlacwlvrglawrwll 

fwayafggcvnhsltlaihdishnaafgtgr 

aarnrwlavfanlpegvpyaasfkkyhvdh 

hrylx3gdgldvdvptrlegwffctparkll 

wlvlqpffyslrplcvhpkavtomevlntlv 

QLA 


1251 


2601 


A 


9875 


150 


1209 


pvimplhfspgdivrpsccvssspklrrnahsk 
lesyrpdtdlsredtgcnlqhisdreniddln 
mefnpsdhprastiflsksqtdvrekrkslfin 
hhppgqiarkysscstiflddstvsqpnlkyti 
kcvalaiyyhiknrdpdormlldifdenlhpl 
sksevppdydkhnpeqkqiyrfvrtlfsaaql 
taecaivtlvylerlltyaeidicpanwkriv 

LGAILLASKVWDDQAVWNVDYCQILKDITVE 

dmnelerqflellqfninvpssvyakyyfdl 

rslaeannlsfpleplsrerahkleaisrlced 

kykdlrrsarkrsasadnltlprwspaiis 


1252 


2602 


A 


9879 


6 


376 


KRPDSRPFAQyRAlTr J KrKl KvyL.rvi il > i w j 

kavgncmgslstanvefcxdvfkelnsnnig 
dniffsslsllyalsmvlloargeteeqlekv 
wnssevcseprslscsrsgsaklilslyq 


1253 


2603 


A 


9880 


180 


388 


KEQAELL YGL YCQCDLTLSSHrSa vrAMDou 

nfthatfvligipglekahfwvgfpllsmyva 
amfgnc 


1254 


2604 


A 


9881 


19 . 


494 


VISFQnTDTIMDSSTAHSPVFLVFPPEn'ASEYb 

stelsattfstqsplqklfarkmkilgtiqilf 
gimtfsfgvirftllkpyprfpfiflsaypfwg 
svlfmsgafliavkrkttetliilsrimnflsa 
lgaiagiilltfefhprsklhl 


1255 


2605 


A 


9896 


72 


38^ 


' rpgreqrdcfqapplglggrqtdmmhhpl.i 
gatcvglpnvgmcpqlsgaltfmylqqgnq 
eatvapdtmaqpyasaqfappqngipqeyta 
phphpapeytgqtt 


1256 
1257 


2606 
2607 


A 

A 


9902 
9905 


95 
374 


399 
459 


■ sggpagllhrpvlpkmglsgllpilvp1*1llu 
diqepghaegilgkpcpkikveceveeidqctk 
prdcpenmkccpfsrgkkcldfrkvsltlyh 
keele 

■ ■ ehijcstpnrlgvvahtcnpstlggrggw 
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seq- 
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NO: of 
peptide 
seqr 
ucnce 


Met 
liod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sec^uence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence. 


Amino acid sequence (A'^Alaninc OCysteine, 
D^Aspartic Acid, EKSlutamic Acid. 
F=Phenylalanine, G=Glycine, H=Histidine, 
l=»IsoIeucine. K^Lysine, LpLeucine, 
M=Methionine,N=Asparagine, P=Prolinc, 
Q=GIutaroine, R=Arginine, &=Sertne, 
T^Ttireonine, V^Valine, W«Tiyptopban, 
Y«Tyrosine, X=Unknown, *«Stop codon, 
possible nucleotide deletion, V=possible 
nucleotide insertion 


1258 


260S 


A 


9911 


364 


1974 


AGPGVPAVGGRWASGPGLGGRTLCSGPPDH 

QRRGPSCGASGDPQCVGSPHPQRARPLLARP 

GARLLPGHLPSPRPPRLPTGQPPAAAFRGPVR 

PQGGOHIHPLPTPGGRPCFAVSEGSGSALLLS 

YLGECGSSSYVTGAAaSPVLRCREWFEAGLP 

WPYERGFLLHQKIALSRYATALHDTVDTSRL 

FRSRSLREFEEALFCHTKirPioWJLJAi wuKWi/ 

PLRDVDEAAVPVLCICSADDPVCGPPDHTLTT 

ELFHSNPYFFLLLSRHGOHCGFLRQEPLPAWS 

HEN^ESFRALTEFFRTEERIKuLoKiiKA&rLU 

GRRRGGALQRREVSSSSNLEEEFNWKRSyrRL 

MAAAAGAAAAPGSREPQDRPECGAGHPGPR 

YYRHPERWLLRPEAFLGPLRTRAPSAEDSQR 

ERPAARSGPEMRVRYPWAAVLAPYLALSQD 

PMVKSSASGQGASOSYNHVREEMLIKAGGA 

MSRRVVRQSKFRHVFGQAAKADQAYEDIRV 

SKXni^SSFCAVNPKFLAIIVEAuuuuAri VL 

FLAK 


1259 


"2609 


A 


9919 


693 


935 


GOTCFIGESTCCWIFPSSVITQCWAKAPRAA 
TLSKAERLRSQPGPEQGOSSYRPRTPTAAAIL 
PPRPGRSHRKRKLVSTK 


1260 


2610 


A 


9921 


455 


1082 


QRSCLCSAIEKDGGDVKALYRRSQALEKLGR 

LDQAVLDLQRCVSLEPKNKVFQEALRNIGGQ 

lOEKVRYMSSTDAKVEQMFQILLDPEEKOTE 

KKQKASQNLVVLAREDAGAEKIFRSNCjVgLtr 

QRLLDMGETDLMLAALRTLVGICSEHQSRTV 

ATLSILGTRRWSILGVESQAVSLAACHLLQV 

MFD ALKEGVKKur KURJiuAU v 


1261 


2611 


A 


9928 


1 


438 


GFRGAEAPGAAQAPKKKKPRPTEGGFGAUSCi 
RGKDPYRGPTLLHQPKPPKDEFLSSLESYEIAF 
PTRVDHNGALLAr ot^rr^i^^KVdJcwAj i ua. a riao 
RLFYKEASPSTHFIXNLTRSSRLLAC3HVSVEY 
WTREGLAWORADRPHCLYA 


1262 


2612 


A 


9931 


168 


435 


AAEMGRAGAAAVBPGLALLWAVGLGGPPPA 
PPRLPFCLQELQGRHALHTFSLERTCSYQDFL 
WADEGRLLHVGAQDLATWHTLSPLGLW 


1263 


2613 


A 


9938 


247 


488 


RMSATSVDQIu^J^viv^vJiNJv V o v yiNuoinv'"-^ 
CNDDDFEPYLRSPDNQSNSYPPMSDPYMPGY 

YAFSIGrr I o L»uiiAA w ai^i- 


1264 


2614 


A 


9941 


61 


277 


' ESIGLTALGPRRRPWEHRWSDPITLKMKUWU 
WLALLLGALLGTAWARRSQDLHCGACKAVR 
RRVROFNIYDY 


1265 


2615 


A 


9956 


2 


522 


■ FVASEVSKMPVPASWPHPPGPFIXLTLLLGLT 
EVAGEEELQMIQPEKIXLVTVGKTATLHCTV 
TSLLPVGPVLWFRGVGPGRELIYNQKEGHFP 
D\rrTV<!nT TiniNNMnFSIRISSITPADVGTYY 
CVKFRKOSPDHVEFKSOAGTELSVRGEYSVO 
FLSOVWWLSSHPFMN . 


1266 


2616 


A 


10002 


243 


387 


■ PKNNA(MXFTAVCQPRCKHUlSUlU*mCI^C 
HPGYAGKTCNOGRKTV 


1267 


2617 


A 


10004 


36 


707 


LPAPASTWSVARETMASSSVPPATVSAATAG 

PGPGFGFASKTKKKHFVQQKVKVFRAADPLV 

GVFLWGVAHSINELSQVPPPVMLLPDDFKAS 

SKIKVNNHLFHRBNLPSHFKFKEYCPQVFRNL 

RDRFGIDDQDYLVSLTKNPPSESEGSDGRFLIS 

YDRTLVDCEVSSEDIADMHSNLSNYHQVRPLS 

RPILSLSSLLTYSSAIVSNRCOLORKHGRENP 


1268 


2618 


A 


10005 


2 


209 ' 


" GEGYELFVPSNGVPAVCHMVGRRPHRAVL5P 
SODELBHSLGESAAQGAAGWLWVSWENTR 
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SEQ ID £ 
NO: of I 
nucl- t 
eotide s 
scq- » 
uence 


5EQID ^ 
^0: of ^ 
)eptide 
seq- 
uence 


^et S 
tod I 
i 
1 
( 


;eq I 

DNO: I 
n I 
JSSN 1 
)9/496 i 
n4 1 


Predicted I 
)eginning i 
mcleotide 
ocation 
jorrespondi 
ag to first 
amino acid 
residue of 
peptide 
sequence 


teiictcd end i 
lucleotide 
ocation 
rarresponding 
to last amino 
acid residue 
of peptide 
sequence 


^ino acid sequence (A^Alanine OOysteine, 
>=Aspartic Acid, E=Glutam!C Acid. 
F=Phcnylalanine, G«Glycine, H=Histidine, 
[=Isoleucine, K=Lysine, D=Leucine, 

K>f~>iiai-Viinn!np N=Acnftr9?ine. I^Pfoline* 
iVl^IVieuiJUiiiJiCj M~~^\i|/c»*«g*i''-» * * 

QOIutamine, R=Arginine, S^Serine, 
T«Thrconinc, V«Valine, W^Tryptophan, 
Y=^yrosine, X=Unknown, ♦=Stop codon, 
A=possiblc nucleotide deletion, \Fpossible 

nucleotide insertion ^ 

TKVSLGLA 


1269 


2619 


A 


10010 


245 


688 


FOMLKNKGHSSlUUJNLAVXNAV/u-v^i^mijruj' 
LQLRNl^ADHSKTQVQKm^JKSLKKDTKAI 

-ri-«'T</^T trvT^nnnvJ cnCCVTTYVT "nPKPPPLTL 

AQKLGLIGPPPPPLSSDEWEKVKQRSLLQGDS 
VQPCPICKEEFELRPQVFSIRG 


IOTA 

l27U 




A 


lOOll 


2 


588 


RVDDFVRPLPPGLMSRSRASIHRGSIPAMSYA 
PFRDVRGPSTHRTQ YVHSrYDRPu w wr kt v-u 
SGNQLLMLDEDEIHPLLIRDRRSESSRNKLLR 
RTVSVPVEGRPHGEHEYHLGRSRRKSVPGGK 
QYSMEGAPAAPFRPSQGFLSRRLKSSIKRTKS 
QPKLDRTSSFRQILPRFRSADHDRYRGWSMW 

DEIDV 


1271 


2621 


A 


10013 


209 


363 


LPAPPNLSPRLSFGFQFPGGNDNYLTITGFSHr 
FLSGAEVSOSCRRRGGRA 


1272 


2622 


A 


10014 


7 


388 


SAVTISWKWRSVMGIQI^PALLASLGAOLV i 
LLGLAVGSYLVRRSRRPQVTLLDFNEKDLLR 
LIDKTLSARSPCKHEYLSTRIDGSLSIRPYrPVT 

sdedogyvdidikvyucgvhptfpeggkmsh 


1273 


2623 


A 


10016 


1 


1339 


MAARTLGRGVGRLLGSLRGLSGQPARPPUiV 

saprraasgpsgsapavaaaaaqpgsypals 

AQAAREPAAFWGPLARDTLVWDTPYHTVW 

DCDFSTGKIGWFLGGQLNVSVNCLDQHVRKS 

PESVALlWERDEPGTEVRITYRELLETrCRLA 

ntlkrhgvhrgdrvaiympvsplavaamla 

CARIGAVHTVIFAGFSAESLAGRINDAKCKW 

ITFNQGLRGORWELKKIVDEAVKHCPrVQH 

VLVAHRTDNKVHMGDLDVPLEQEMAKEDP 

VCAPESMGSEDMLFMLYTSGSTOMPKGIVHT 

QAOYLLYAALTHKLVFDHQPGDIFGCVADIG 

witghsywygplcngatsvlfestpvyfna 
grywetverlkinqfygaptavrlllkygd . 
awvkkydrsslrtlgsvgepinceawewlh 

RWGDSRCTLVDTWWQT 


1274 


2624 


A 


10017 


1 


3750 


"FRPQGTPRSPASHVLTMSAPDEORRDFFM'KU 
KTLGSFFGSLPGFSSARNLVANAHSSARARPA 
ADPTGAPAAEAAQPQAQVAAHPEQTAPWTE 
KELQPSEKMVSGAKDLVCSKMSRAKDAVSS 
GVASWDVAKGWQGGLDTTRSALTGTKEV 
VSSGVTGAMDMAKGAVQGGLDTSKAVLTG 

tkdtvstgltgavnvakgtvqagvdttktv 

ltgtkdtvttgvmgavnlakgtvqtgvets 

kavltctkdavstgltgavnvargsiqtgv 

dtsktvltctkdtvcsgvtgamnvakgtiqt 

gvdtsktvltgtkdtvcsgvtgamnvakgt 

iotgvdtsktvltgtkdtvcsgvtgamnva 

kgticttgvdtotvltgtkntvcsgvtgavn 

lakeaiqggldttksmvmgtkdtmstgltg 

aanvakgamqtglnttqniatgtkdtv^g 

vtgamnlargtiqtgvdtikivltgtkdtvc 

sgvtgaanvakgavqggldttksvltgtkd 

AVSTGLTGAVNVAKGTVQTGVDrrKTVLTG 
TKDTVCSGVTSAYNVAKGAVQGGLDTrKSV 
VIGTKDTMSTGLTGAAKVAKGAVQTGVDTA 

ktvltgtkdtvttglvgavnvakgtvqtgm 

DmTVLTGTKDTIYSGVTSAVNVAKGAVQT 
GLmQNlATGTKNTFGSGVTSAVNVAKGAA 
OTGVDTAKmTGTKDTVTTGUVlOAVNVAK 

gwc^vdttktvltotcdtvcsgvtqaan 
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SEQID i 
NO: of T 
nucl- T 
eotide s 
seq- \ 
ueace 


EQDD A 
40: of h 

»C(J- 

icnce 


rlet S 
od I 
i 

i 


JEQ I 
DNO: \ 
n I 
JSSN 1 
)9/496 < 
?14 1 


'redicted 1 I 
)eginning t 
lucleotide 1 
ocation < 
xmespondi t 
i\g to first i 
amino acid < 
residue of 
peptide 
sequence 1 


'redicted end i 
lucleotide 
ocation 
x)JTesponding 
0 last amino 
icid residue 
3f peptide 
sequence 


iVmino acid sequence {A=Aianine <j=cysiemc, 
OAspaitic Acid, EKilutamic Add, 
F«Phenylaianinc G=Glycinc, H»Histidinc, 
l=lsolcucinc, K=Lysine, I>Leucine, 
M=Methioninc, N=AsparagiDe, P==Pioline, 
Q=Glutaminc, R«Argtaine, S»Serine> 
T=Threoninc, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X^'Unknown, ♦=Stop codon, 
/^possible nucleotide deletion, \ppossible 

nucleotide insertion . 

VAKGAIQGGLDTTKSVLTGTKDAVSTOLTGA 

VKLAKGTVQTGMDTTKTVLTGTKDAVCSGV 

TGAANVAKGAVQMGVDTAKTVLTGTKDTV 

CSGVTGAANVAKGAVQTGLKTTQNIATGTK 

NTLGSGVTGAAKVAKGAVQOOLDTTKSVLT 

nTK-nAVSTGLTQAVNLAKGTVQTGVDTSKT 

VLTGTKDTVCSGVTGAVNVAKGTVQTGVDT 

AKTVLSGAKDAVTTGVTGAVNVAKGTVQTG 

VDASKAVLMGTKDTVFSGVTGAMSMAKGA 

\inrn{ nTTKTVLTGTKDAVSAGLMGSGNVA 

TGATHTGLSTFQNWLPSTPATSWOGLTSSRT 

TDNGGEQTALSPQEAPFSGISTPPDVLSVGPEP 

AWEAAATTKGLATDVATFTQGAAPGREDTG 

LLATTHGPEEAPRLAMLQNELEGLGDFHPM 

NAEEQAQLAASQPGPKVLSAEQGSYFVRLGD 

T rpQPPnPAFEHAVSHLOHGOFQARDTLAQL 














1275 


2625 


A 


10025 


124 


415 


TIIJUUCKEKTCPCKKEIGWSISRSGMY^ 

YKRKYSAANTKVEKKKKEKVLAPVTKPVGG 

DKNGGTRWKLPTNffRYYPTEDVPRKLLSHG 

KKPFS 


1276 


2626 


A 


10030 


3 


507 


■qgslrfspprvpscsrvfcpvppggcglfsi'ms 

ASRPQSPrrPWCLPRRYMKHKRDDGPEKQH) 
EAVDVTPVMTCVFVVMCXSMLVIXryTyDL 
T \/vwTrtTFrLASATGLYSCLAPCVRRLPFGK 
CRIPNNSLPYFHKRPQARMLLLALFCVAVSV 

VWGVFRNEDO 


1277 


2627 


A 


10035 


51 


"IS 


"YSRFTVPLPATMASSEVARHLLFQSHMA'lKl 
TCMSSQGSDDEQIKRENIRSLTMSGHVGFESL 
pnnt vNR^ilOOGFCFNILCVGETGlGKSTLIDT 
LF^^OTEDYESSHFCPNVKLKAQTYELQESN 
VOLKLnVNTVGFGDQINKEERQLORSQSTEN 
POKYRSEQHPVEPKKCTSFWKGALGKWAGIE 
SSGQSAQQPYLPINSPPHRLADVADVHLFSSV 
LSGAFGCYHLDVTVNEFKKQQNRDEQEGYS 

■ -^^SKSLRPeCPRbttiAGCLTVSUPSl 


1278 


2628 


A 


10036 


3 


457 


VMGAVGESLSVQCRYEEKYKTFNKYWCRQP 
CLPIWHHvlVETGGSEGVVRSDQVinDHPGDL 
TFTVTLENLTADDAGKYRCGIAmQEDGLSG 
PI pnpPFnvQVT .VSSASSTENSVKTP 


1279 


2629 


A 


" 10039 


214 


1 435 


■ NDSLVPMSSWKSCARAPSSKSAWRRSAAIKK 
SRKCLRTKRKRWSSOKGTQMQSTLSETPRRA 
OMPCMWWYPFWG 


1280 


2630 


A 


10043 


2 


344 


■ "RATWHNAGKEREAVQLMAGAEKRVKASHS 
FLRGLFGGNTRIEEACEMYTRAANMFKMAK 
NWSAAGNAFCX3AAKLHMQLQSKHDSATSFV 
nAGNAYKKADPOGKTARHVACYLCV 

- - rtQQT PQVK 1 V \!\t\ kFTRSHFLPLMKWTO 


1281 


2631 


A 


10080 


" 620 


818 


V lYivi J jLr oiriirrirj&i i^n* tat^MM.^ t *'« 
PTMAHCSLKILASRNSADSAFLSAGDTSLSHST 


1282 


2632 


A 


10084 


3 


1640 


" 'SASlURGDKRASGEVGlAPSSRHlLlUKlTSAky 
NGTAIISLVRGPGILGEVTVFWRIFPPSVGEFA 
ETSGKLTMRDEQSAVIWIQALNDDIPEEKSF 
YEFQLTAVSEGGVLSESSSTANnWASDSPY 
GRFAFSHEQLRVSEAQRVNITCIRSSGDFGHVR 
LWYKTMSGTAEAGLDFVPAAGELLFEAGEM 
RKSLHVEILDDDYPEGPEEFSLTITKVELQGR 
GYDFTIQENGlXjmQPPEIGNISIVRinMKNDN 
AEGIIEFDPKYTAFEVEEDVGLIMIPWRLHGT 
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SEQID S 
NO: of 1 
nucl- 1 
eotide J 
seq- 1 
uence 


JEQID ^ 
TO: of ii 
xptide 
jeq- 
ience 


^ct i 
Od ] 
i 
1 


m 1 

DNO: 1 
n 1 
JSSN 
39/496 
914 


=>redicted ] 
beginning i 
ludeotide 
ocation 
conespondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
lucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc COysteine, 
[>Aspartic Acid, &=OIutamic Acid. 
F=Phenylalanine, G=Glycine, H-Histidine, 
Nlsoleuciue, K=Lysine, L=Lcucine, 
S4=Methicjnine, N«Asparagine, P=Proline, 
Q=Glutamine, R^Argkiine, S«Serine, 
TBsThrconine V^Valine. W^Tryptophan, 
Y=^rosine, X=Unlcnown. ♦=Stop codon, 
/=possible nucleotide deletion, V=posslble 
nucleotide uisertion 














YGYVTADFISQSSSASPGGVDYILHGSTV lyg 

HGQNLSFINISUDDNESEREEPIEILLTGATGG 

AVLGRHLVSRHIAKSDSPFGVIRFLNQSKISIA 

NPNSTMILSLVLERTGGLLGEIQVNWETVGPN 

SQEALLPQNRDIADPVSGLFYFGEGEGGVRTn 

LTIYPHEEIEVEETFIIKLHLVKGEAKLDSRAK 

ALEGPLLITFFVRRVKGTFOEIM 


1283 


2633 


A 


10088 


316 


516 


MGSKTLPAPVPIHPSLQL™ySFLQAVNGLFr 
VPSDHLFNLYGFSALHAVHLHQWTLGYPAM 

HLXRS 


1284 


2634 


A 


10091 


2 


569 


FVSPSRAMASALl Y vaiu-KSFVlLV V 1 Yl^iAAjr 

LVILMPAKFVRCAYVnLMAIYWCTEVIPLAV 

TSLMPVLLFPLFQILDSRQVCVQYMKDTNML 

FLGGLIVAVAVERWNLHKRIALRTLLWVGA 

KPARLMLGFMGVTALLSMWISNTATTAMMV 

PIVEAILQQMEATSAATEAGLELVDKOKAKE 

LP 


1283 


2635 


A 


10092 


290 


728 


KQSTRPDVMTLYFLHWgbtlMSuEb V V bbA V Y 
AAATRTTSFKGTSPSSKYVKLNVGGALYYTT 
MQTLTK.QDTMLKAMFSGRMb VL 1 Uatu w u- 
IDRCGKHFGTILNYLRDGAVPLPESRREIEELL 
AFAKYYLVOGLVEECQAALOV 


1286 


2636 


A 


10100 


1 


574 


"RPRGRGAWAGPGGDYSGVRRQQRRRTRISUS 
QRGSDAAGTMGCCTGRCSLICLCALQLVSAL 
ERQIFDFLGFQWAPILGNFLHUWILGLFGTIQ 
YI^RYIMVYTV WTAL W V l win vr in.*r i i-e- ▼ ^ 
GLSKDTDLMTFNISVHRSWWREHGPGCVRR 
VLPPSAHGMMDDYTYVSVTGCIVDFQYLEVI 


1287 


2637 


A 


10103 


252 


376 


RSRMGDKPIWEQIGSSFIQHY YQLFDNDRl 
GAIYVSFOL 


1288 


2638 


A 


10107 


1 


478 


MEEEDESRGKTEESGEDRGDGPPDRDl'l Lbf b 
AFILRAIQQAVGSSLQGDLPNDKDGSRCHGL 
RWRRCRSPRSEPRSQESGGTDTATVLDMATD 
oT>T A /-»T \7c\n r^DPnTWVP^STll DI RPGESEDM 
LELVAEVRIGDRDPIPLPVPSLLPRLRAWRTG 

KT 


1289 


2639 


A 


10113 


237 


438 


LLSRMPSTNRAGSLKDPElAELFFKEDPblU.Ki 
DLREIGHGSFGAAYFARDVRTNEVVAIKKMS 

YSG 


1290 


2640 


A 


" 10114 


367 


856 


■ -RGAKAKSAVLPPGPPCSSlLlLSfPAPLTPRSt'U 
TEATRFTAMSKSLKKKSHWTSKVHESVIGRN 
PEGQLGFELKGGAENGQFPYLGEVKPGKVAY 
ESGSKLVSEELLLEVNETPVAGLTIRDVLAVI 
KHCKDPLRLKCVKQGESSGLLSNO-PGGGTAR 

GAGO 


1291 


2641 


A 


10116 


128 


591 


■ "RTIRETERRSALSCSVLKSKPLPGLQFgAS^v^K 
RRRLPGRRQVQVQEGGGSGLRAWVLAMASV 
LGSGRGSGGLSSQLKCKSKRRRRRRSKRKDK 
VSD-STFlj\PFKHLSPGITNTEDDDTLSTSSAE 
VKENRNVGNLAARPPPSGDRARGGATR 


1292 


2642 


A 


10121 


1 


749 


■ QRRRFRAGLWGGHGLTUOLRRNGUCUUiiAK 
VPRVGERLRGHRCPDPLCLLLDMLFLSFHAG 
SWESWCCCCLIPADRPWDRGQHWQLEMADT 
RSVHETRFEAAVKVIQSLPKNGSFQFTNEMM 
LKFYSFYKQATEGPCKLSRPGFWDPIGRYKW 
DAWSSLGDMIKEEAMJAYVEEMKKIIETMP 
KTTEKVEELLRVIGPFYEIVEDKKSGRSSDrrSD 
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SEQID j ; 
NO: of ] 
nucl- 1 
cotide 
seq- 
uence 


SEQID 1 
[40: of 
peptide 
seq- 
uence 


Met : 
fiod ] 


3EQ 1 

DDNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

ocation 

correspondi 

hg to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence ^^/v='/>^au"**' >.^^-vrjr»iwi*i%> 
D=Aspartic Acid, EOlutamic Acid, 
F=Phenylalamne, G=Glycine» H=Histidine, 
I=IsoIcucine, K=Lysine, L=4-eucine, 
M=Methionine, N=Asparagine, P=ProUne, 
Q=Glutamine, R«Arginine, S=Serine4 
T=Thrconine, V-Valinc. W«=Tiyptophan, 
Y^Tyrosine, X»Unknown, ♦=StDp codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














LGNVLTSTrt^AkTVNGKAEStoSGAESEEEB 
AC 


1293 


2643 


A 


10124 


2 


989 


PLMSLVRVVEFVAASSAQKTPSRLENYYMVC 

KADEKFNQLVHFLRNHKQEKHLVFFRYSSGL 

CGRGIRDSARMCSTCACVEYYGKALEVLVK 

GVKIMCfflGKMKYKJRNKIFMEFRKLQSGILV 

CroVMARGIDIPEVNWVLQYDPPSNASAFVH 

RCGRTARIGHGGSALVFLLPMEESYINFLAIN 

QKCPLQEMKPQRNTADLLPKLKSMALADRA 

VFEKGMKAFVSYVQAYAKHECNLIFRLKDL 

DFASLARGFALLRMPKMPELRGKQFPDFVPV 

DVKTDTPFKDKIREKQRQKLLEQQRREK'raN 

EGRRKFDCNKAWSKQKAKKK 


1294 


2644 


A 


10129 


91 


1042 


" VTMYKDCIESTGDYFLLCDAEGPWGnLESLA 

ilgivvtiuxiju^fu«rkjqdcsqwn\^^^ 

LLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLF 

GVLFALCFSCLLAHASNLVKLVRGCVSFSWT 

mClAIGCSLLQinATEYmiMTRGMMFVN 

MTPCQL>IVDFVVLLVYVLFLIv1ALTFFVSKAT 

FCGPCENWKQHGRLIFITVLFSIIIWYVWlbMi- 

LRGNPQFQRQPQWDDPWCIALVTNAWVFL 

LLYIVPELCfl-YRSCRQECPLQGNACPVTAYQ 

HSFOVENOELSRDKWKVLLNSDFLSHSGA 


1295 


2645 


A 


10133 


376 


518 


RPRVYTHNSQWCFLPQDHl'UWLKiQbUAl'O 
nRGAPROEGPGSSWRQV 


1296 


2646 


A 


10135 


3 


551 


EWSLDPFMGIMSGQVGDLSPSQEKSLAQhKli 

niodvlsalpnpddyfllrwlqarsfdlqb:s 

EDMLRKHMEFRKQQDLANILAWQPrbVVKL 
YNANGICGHDGEGSPVWYHIVGSQDPKGLLL 
SASKQELLRDSFRSCELLLRECELQSQKLGKR 
VRKHAIFGLEGLGLRDLWKPGIELLQE 




2647 


A 


10138 


48 


407 


"MVSSCCGSVCSDQGCGQDLCQETCCRPSUJb 
TTCCRTTCCRPSCCVSSCClu'QCCVia v<-,uv^r i 

csrpsccqttccrttcyrpsccvssccrpqcx: 

QPVCCOPTCCRPSCCETTCCHPXCC 




1 2648 


A 


10156 


94 


453 


" GGNRKSAEMFSQVPRTPASGCYYLNSM'll'bU 
OEMYLRFDQTTRRSPYRMbKll-AKitv^i^v irss 
QQEIEAKEACDWLRAAGFPQYAQLYEDSQFP 
TNIVAVKNDHDFLEKDLGEPLCRRLNT 


1299 


2649 


A 


10161 


1 


393 


■ PRFSELVDORGRVSARFGGSPSKAATVRSgn 
ASAQLENMEBAPKRVSLALQLPEHGSKDIGN 
VPGNCSENPCQNGGTCVPGADAHSCDCGPGF 
KGRRCEIACKVSiaKniaJ^EIKAFPVWEGG 

VCHHV 


1300 


26S0 


A 


10162 


98 


391 


■ AKIASLERIMPANYTCTRPDGDNTDFRY1''1YA 
VTYTGILGPGLIG>niJ\LWVF^rGYMKETKRA 
VIFMINLAIADLLQVI^U'UUFYYIJCHDWPF 

VPV 


i3or 


2651 


A 


10165 


" 1 


7545 


■ PGIRVGITSQTGLSSNLQKNCSKLAFlSSliUib 
KQLQCMPMEGRGRASSSISDLQGKGFEKGTG 

ekhvpgvgsarhspqasaggspwqrgkaqt 

rwlgkpdpgrkrrrgspqeegglrvsaaar 

llcsganrckvlvrqnstpntqqpavhpstp 

psrplpqagrclvaplrphpdwvaaktlaka 

lrapgkpwrlaapsplgdlgapglpgpstap 

rtlsveepgvecnqlclyadvtdpvlcloqk 

dpgvegkhcekekissskelkhvhaksepskp 

arrlseslhwdenkneskierehkrrtstpv 

iMEGVQEETOTRDVKRQVERSEICrEEPQKQ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

0SSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc OCysteine, 
l>Aspartic Acid, E=Glutamic Acid, 
F«PhenylaIanine, G^Glycine, H=Histidinc, 
I^lsoleucine, K^Lysine, 0=Leucine, 
M=Methioninc, N=Asparagine, I^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V^Valinc, W'=Tryptophan, 
Y^Tyrosine. X«Unknown, ♦=Stop codon, 
/^possible nucleotide deletion, V^ssiblc 
nucleotide insertion 














KSTLKNEKHLKKDDSETPHLKSLLKKEVKSS 

KEKPEREKTPSEDKLSVKHKYKGDCMHKTG 

DETELHSSEKGLKVEENIQKQSQQTKLSSDDK 

TERKSKHRNERKLSVLGKDGKPVSEYIIKTDE 

NVRKENNKJCERRLSAEKTKAEHKSRRSSDSK 

IQKDSLGSKQHGITLQRRSESYSEDKCDMDST 

NMDSNLKPEEVVHKEKRRTKSLLEEKLVLKS 

KSKTQGKQVKWETELQEGATKQATTPKPD 

KEKNTEENDSEKQRKSKVEDKPFEETGVEPV 

LETASSSAHSTQKDSSHRAKLPLAKEKYKSD 

KDSTSTRLERKLSDGHKSRSLKHSSKDIKKKD 

ENKSDDKDGKEVDSSHEKARGNSSLMEKKL 

SRRLCENRRGSLSQEMAKGEEKLAANIXSTP 

SGSSLQRPKKSGDMTLIPEQEPMEIDSEPGVE 

NVFEVSKTQDNRNNNSHQDIDSENMKQKTS 

ATVQKDELRTCTADSKATAPAYKPGRGTGV 

NSNSEKHADHRSTLTKKMHIQSAVSKMNPGE 

KEPIHRGTTEVNIDSBTVHRMLLSAPSENDRV 

QKNLKNTAAEEHVAQGDATLEHSTNLDSSPS 

LSSVTWPLRESYDPDVIPLFDKRTVLBOSTA 

STSPADHSALPNQSLTVRESEVLKTSDSKEGG 

EGFTVDTPAKASITSKRHIPEAHQATLLDGKQ 

GKVIMPLGSKLTGVIVENENITKEGGLVDMA 

KKENDLNAEPNLKQTKATVENOKKDGIAVD 

HWGLNTBKYAETVKLKHKRSPGKVKDISID 

VERRNENSEVDTSAGSOSAPSVLHQRNGQTB 

DVATGPRRAEKTSVATSTEGKDKDVTLSPVK 

AGPATTTSSETRQSEVALPCTSIEADEGLnGT 

HSRNNPLHVGAEASECrVFAAAEEGGAWTE 

GFAESETFLTSTKEGESGECAVAESEDRAADL 

LAVHAVKIEANVNSWTEEKDDAVTSAGSEE 

KCDGSLSRDSEIVEGTITFISEVESDGAVTSAO 

TEIRAGSISSEEVDGSCKjNMMRMOPKKETEG 

TVrCTGAEGRSDNFVICSVTGAGPREERMVT 

GAGWLGDNDAPPGTSASQEGDGSVNDGTE 

GESAVTSTGITEDGEGPASCTGSEDSSEGFAIS 

SESEENGESAMDSTVAKBGTNVPLVAAGPCD 

DEGIVTSTGAKEEDEEGEDWTSTQRGNEIGH 

ASTCTOLGEESEOVLICESAEGDSQIGTWEH 

VEAEAGAAIMNANENNVDSMSGTEKGSKDT 

DICSSAKGIVESSVTSAVSGKDEVTPVPGGCE 

GPMTSAASDQSDSQLEKVEDTTTSTGLVGGS 

YDVLVSGEVPECEVAHTSPSEKEDEDHTSVE 

NEECDGLMATTASGDITNQNSLAQGKNQGK 

VLIlSTSTTNDYTPQVSArrDVEGGLSDALRTE 

ENMEGTRVTTEEFEAPMPSAVSGDDSQLTAS 

RSEEKDECAMKTSIGEEFELPISSAimCAES 

LQPVAAAVEERATGPVLISTADFEGPMPSAPP 

EAESPLASTSKEEKDECALISTSIAEECEASVS 

GVWESENERAGTVMEEKDGSGIISTSSVEDC 

EGPVSSAVPQEBGDPSVTPAEEMGDTAMISTS 

TSEGCEAVMIGAVLQDKDRLTrmVEDLSDA 

AUSTSTAECMPISASIDRHEENQLTADNPEGN 

GDLSATEVSKHKVPMPSLIAENNCRCPGPVR 

GGKEPGPVLAVSTEEGHNGPSVHKPSAGQGH 

PSAVCAEKEEKHQKBCPEIGPFAGRGQKESTL 

HLINAEEKNVLLNSLQKEDKSPETGTAGGSST 

ASYSAGRGLEGNANSPAHLRGPEQTSGQTAK 

DSSVSSIRYLAAVNTGAIKADDMPPVQGTVA 

EHSFLPAEQQGSEDNLKTSTTKCITGQESKIAP 
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PCTAJSOl/03800 



SEQID : 
NO: of ] 
nucl- 1 
eotide 
seq- 
uence 


SEQID 1 
HO: of 1 
peptide 
seq- 
uence 


Vict 1 
tod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspond! 
ng to first 
amino acid 
residue of 
peptide 
sequence _ 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C==Cystein^ 
D=Aspartic Acid, E=<jlutamic Acid, 
F=PhenylaIanine, GMjlycine, H=Histidinc, 
Msoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparaginc, I^Proline, 
Q=Glutamine, R-Ai^inine, S^Serine, 
T^Thrconinc, V^Valinc, W«Tryptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon. 
/^possible nucleotide deletion, >r=possible 
nucleotide insertion 














■ SHTNnPPATYSVALLAPKCEQDL'llKNDYSGK 
WTDQASAEKTGDDNSTRKSFPEEGDIMVTVS 
SEENVCDIGNEESPLNVLGOLKLKANLKMEA 
YVPSEEEKNGEELAPPESLCGGKPSGIAELQRE 
PLLVNESLNVENSGFRTNEEIHSESYNKGEISS 
GRKDNAEAISGHSVEADPKEVEEEERHMPKR 
KRKQHYLSSEDEPDDNPDVLDSRIETAQRQC 
PETEPHATKEENSRDLEELPKTSSETNSTTSRV 
MEEKDEYSSSETTGEKPEQNDDDTIKSQE 


1302 


26S2 


A 


10167 


321 


842 


EPSLFPFLRPSPARPPPRFPAPFPSPELACiPlim 

FVFYFFLSYWPKELAKYEYMEEQVBLTEKG 

N^AGRGTSVRaSPSPRPLPPLLPLLADLLE 

DGFGEHPFYHCLVAEVFKEHWTPEGNPSPFP 

EARETKCYVRSSVGCVEPLTTQAEVTENLDR 

KNSOOVFKLLKKK 


1303 


2653 


A 


10171 


206 


429 


NMILLKKRRLLINSLGEGTiNGLLDELLKiNV 
LSQEDTEIVKCENVTVIDKARDLLDSVIRKGA 
RACEICITYI 


1304 


2654 


A 


10184 


970 


1524 


■LCTLSPGISGTAGSCLTTEPGTELGTSFAQNCif 
YHEAWLFTQAUCLNPQDHRLFGNRSFCHER 
LGQPAWALADAQVALTLRPGWPRGLFRLGK 
ALMGLQRFREAAAVFQETLRGGSQPDAAREL 
RSCLLHLTLQGQRGGICAPPLSPGALQPLPHA 
ELAPSGLPSLRCPRSTALRSPGLSPLLH 


130S 


2655 


A 

A 


10194 


2 


394 


TDLLGRRFRVDGAAMAACEGRRSGALCjSSg 

SDFLTPPVGGAPWAVATTWMYPPPPPPPHR 

DHSVTLSFGESYDNSKSWRRRSCWRKWKQL 

SRLQRNMILFLLAFLLFCGLLFYINLADHWKG 

IRNTCT 


1306 


2650 


A 

A 




1 


410 


IPGSTTSLEGPLSKWTNVMKGWQYRWFVLUY 

NAGLLSYYTSKDKMMRGSRRGCVRLRGAVl 

GIDDEDDSTFTITVDQKTFHFQARDADEREK 

WIHALEETILRHTLQLQVRVFTWFPDSSLVGA 

FFFWLVSGFFFK 






A 


10205 


85 


308 


QGLPSTMVKLGCSFSGKPGKDPGDQIXiAAM 
DSVPLISPLDISQLQPPLPDQWIKTQTEYQLS 
SPDOONYTKSR 








10214 


2 


453 


ECGGIRQPGPGPPPALASAPAATMNRVCiiiSi'S 
AAANYLLCTNCRKVLRKDKRIRVSQPLTRGP 
SAHPEKEWQANTVDERTNFLVEEYSTSGRL 
DNITQVMSLHTQYLESFLRSQFYMLRNaXjPL 
PLPYRHYIAIMAAARHQCSYLINM 


1309 


2659 


A 


10233 


45 


421 


RGWPEQQSTGRPRDVARQPRCQKEEGRKUa- 
RALESRTFQGSERSRWGPPLESTKENVQCGH 
RPAFPNSSWLPFHERLQVQNGECPWQVSIQM 
SRKHLCGGSILHWWWVLTAAHCFRRTLLDM 

AV 


1310 


2660 


A 


10241 


243 




" " AFQLFNAKCESAFLSKRNPLQRN WTVLYKKIL 
HKKGQSAEIQKKRlRRAFKFQRArrGASLADI 

MAK 


1311 


2661 


A 


10261 


751 


176 


■ ■ LPGADYGGGHLSLRLFHLLLTSAAWVPDbbi^ 
VTLNSAICVLSTVLIMEFPDLGKHCSEKTCKQ 
LDFLPVKCDACKQDFCKDHFPYAAHKCPFAF 
OKDVHWVCTLCNTPIPVKKGQIPDVVVGDHI 
DRDCDSHPGKIOCEKlFmCSKEGCKKKML 
QMVCAQCHGNFCIQHRHPLDHSCRHGSRPn 

KAG 


1312 


2662 


A 


10270 


3 




" STSSDEGSPSASTPMINKTGFKFSAEKFVlJiVf 
SMTILDKKDGEOAKALFEKVRKPRAHVEDSD 
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PCT/USOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^AJanine C=Cysteinei 
D^Aspartic Acid, EFGlutamic Acid, 
F«=Phenylalanine, G=Glycinei H«Histidine, 
Islcoleudne. K=Lvsine. Lr^jCUcineL 
M^Methionine, N=Aspar9gine, P^Pioline, 
QKjlutamine, R-Aiginine, S-Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦^Stop codon, 
/^possible nucleotide deletion, \Fpossible 
nucleotide insertion 














Lrm.YWQTmTAKFlFlLCYTANFVNAlSF 
EHVCKPKVEHLIGYEWECTHNMAYMLKKL 
LISYISnCVYGFICLYlUnVIJRIPLKEYSFEKV 

KW^OmSEVSENKLREISLNHEWTFEKL 


1313 


2663 


A 


10287 


1221 


266 


■ GAHRVLSPAQGAQPRLRSAASVEVSMVGQR 
VLLLVAFLLSGVLLSEAAKILTISTLOGSHYLL 
LDRVSQILQEHGHNVTMLHQSGKFLIPDIKEE 
EKSYQVIRWFSPEDHQKRIBCKHFDSYIETALD 
GRKESEALVKLMEIFGTQCSYLLSRKDIMDSL 
KNENYDLVFVEAFDFCSFLIAEKLVKPFVAIL 
PTTFGSLDFGLPSPLSYVPVFFSLLTDHMDFW 
GRVKNFLMFFSFSRSQWDMQSTFDKTIKEHF 
PEGSRPVLSHLLLKAELWFVNSDCAFDFARPL 
LPhTTVYIGGLMEKPIKPVPOVSEPSAFSLGFT 


1314 


2664 


A 


10288 


■536 


1890 


"nvqlakfsstlvfffscdadpsalakyvLal 

VKKDKSEKELKALCIDQLDVFLQKETQIFVEK 

LFDAVNTKSYLPPPEQPSSGSLKVEFFPPQEK 

DIKKEEITKEEEREKKFSRWLNHSPPQSSSRYR 

ENRSRDERKKDDRSRKRDYDRNPPRRDSYRD 

RYNRRRGRSRSYSRSRSRSWSKERLRERDRD 

RSRTRSRSRTRSRERDLVKPKYDLDRTDPLEN 

NYTPVSSVPSISSGHYPVPTLSSTITVIAPTHHG 

NNTTESWSEFHEDQVDHNSYVRPPMPKKRC 

RDYDEKGFCMRGDMCPFDHGSDPVWEDVN 

LPGMQPFPAQPPWEGPPPPGLPPPPPILTPPPV 

NLRPPVPPPGPLPPSLPPVTGPPPPLPPLQPSG 

MDAPPNSATSSVFTVVTTGIHHQPPPAPPSLFT 

ADTYDTDGYNPBArSl 1 w l^KrM i KJiK v nrn. 

AKLG 


1313 


266S 


A 


10293 


447 


1331 


SHPLLSCPEKVSAKLRAAAEAAAEERRTROA 
GSRGICAGLRSVAPGPEPLKQEEGRREWdSSI 
GTPSPCGSAQAAAAAAAEEATEKIPALRPALL 

MCVTYHNGTGYCKCPEGFLGEYCQHRDPCE 

KNROQNGOTCVAQAMLOKATCRCASGFTGE 

DCQYSTSHPC3FVSRPCLNGGTCHMLSRDTYE 

CTCQVGFTGRNPKCPGGNLNYQFNGnWYS 

GGSVPPSGTKTSKPAEHNAMGTGSKNFASGT 

LWVMVSGATSTSTSTL 


1316 


2666 


A 


10294 


118 


572 


SLSMESNHKSGDGLSGTQKEAALRALVQKlXi 

YSLVQBNGQRKYGGPPPOWDAAPPERGCEin 

GK1J»RDLFEDELIPLCEKIGKIYEMRMMMDF 

NGNNRGYAFVTFSNKVEAKNADCQLNNYEIR 

NGRLLGVCASVDNCRLFVGGIPKTKK 


1317 


2667 


A 


10301 


158 


1956 


iXKSCQYLLSGVaPCEGKGPTVLVIQTAVPQ 

DRPTKSSMRSAAKPWNPAIRAOGHGPDRVRP 

LPAASSGMKSSKSSTSLAFESRLSRLKRASSE 

rnLNKPGSTAASGWRLKKTATAGAISELTES 

RLMGTOAFITnCRTOIPAPREFSVTVSRERSV 

PRGPSNPMCSVSSPTSSKTPTPTKHLRTPSTKP 

KQENEOGEiCAALESQVRELLAEAKAKDSElN 

RLRSELKKYKEKRTLNAEQTOALGPNVDGTS 

VSPGinWMIRALEEIfl^IKNFQKELSDLEEENR 

VLKEKLIYLEHSPNSEGAASHTGDSSCPTSITQ 

ESSFGSPTGNQLSSDIDEYKKNIHGNALRTSG 

SSSSDVTKASLSPDASDFEHITAETPSRPLSSTS 

NPFKSSKCSTAGSSPNSVSELSLASLTEKIQKM 

EK^STAEELQAn-QELSDQOOMVQELTAE 
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PCTAJSOl/03800 



SEX2ID : 
NO: of 1 
nucl- ] 
eotide : 
seq- 1 
uenoe 


SEQ ID I 
W:of 1 
[)qitide 
»q- 
aence 


Viet 1 5 
lod 1 

i 

1 

1 


5EQ ] 

DNO: 

n 

USSN 
[)9/496 
914 


Predicted 

t>eginnmg 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide ' 
sequence 


Amino acid sequence (A=Alamne 0=Cystcine, 
D=Aspartic Acid, &=Glutaniic Acid, 
F=Phenylalanine, G=Glycbe, H=Histidine, 
I=Isoleucme, K=JLysine, Lr-JLeucme, 
M=Methionine, N«Asparagine, P«=Proline, 
Q=Glutaminc, R=Arginine, S=Scrinc> 
T=Thrconine, V=Valine. W=Tryptophan. 
Y^Tyrosine, X=Unknown, ♦sStop codon, 
/^possible nucleotide deletion, >Fpossible 
nucleotide insertion 














NEKLVDEKTILETSFHQHKERAEQLSQENEKi- 

MNLLQERVKNEEPTTQEGiaiELEQKCTGILE 

QGRFEREKLLNIQQQLTCSLRKVEEENQGAL 

EMIKIOJKEENEKLNEFLELEWiNNNM^ 

EEOIVTLBGLKMENGSLKSHLOG 


1318 


2668 


A 


1O303 


333 


879 


GECFIMAAWQQNDLVFEFASNVMEDERQL 

GDPAIFPAVIVEHVPGADILNSYAGLACVEEP 

M)MrrESSLDVAEEEIIDDDDDDITLTVEASCH 

DGDETCETTEAAEALLNMDSPGPMLDEKRINN 

NFSSPEDDMVVAPVTHVSVTLDGIPEVMETQ 

OVOEKYADSPGASSPEQPKRKKK 


1319 


2669 


A 


10322 




654 


MEVW4SGSVAVTRAIAVPGLLLLJLUATALSL 
LIGAKSLPASWLEAFSGTCQSADCnVLDAR 
LPRTLAGLLAGGAU5LAGALMQTLTRNPLAD 
PGLLGVNAGASFATVLGAALFGYSSAQEQLA 
MAFAGALVASLIVAFTGSQGGGQLSPVRLTL . 

AGVXL 


1320 


2670 


A 


10323 


441 


2 


KMNQVAWIGGGQTLGAFLCHGLAAEGYRV 

AWDIQSDKAANVAQEINAEYGEiSMAYGFG 

ADATSEQSVLALSRGVDEIFGRVDLLVYSAGI 

AKAAHSDFQLGDFDRSLQVNLVGYFLCARE 

FSRLMIRDGIOGRnQINSKSDE 


1321 


2671 


A 


10332 


1 


453 


RHRTAGPGSTISSRTDSASAPAARAMPCEYIY 

AKLTSDCSRPSLQWYTRAQSKMRRPRLLLKD 

lUCCTIiVFGVRJLYILKLNYTrEECDMKNMH 

YVDPDHVKRAQKYAQQVLQKESPPKFAKTS 

MALLFEHRYSVDLLPFVQKAPTDSEA 


1322 


2672 


A 


10333 


25 


423 


EPSNOPWYSALGNEDDEILLLGKDlIG'll'AAS 
ERKMRAHQVLTFLLLFVrrSGASENASTSRGC 
GLDLLPQNVYLCDLDAIWGIWEAVAGAGA 
LlTUXNIUIXGRlJTIKEKESaCSPAVLHFLFL 

LGTLG 


1323 


2673 


A 


10334 


52 


426 


SSLGNEDDElLSLAKDITGMFVASHRiMRAhl 
QVLTFLLLFVrrSVASENASTSRGCGLDLLPQ 
YVSLCDLDAIWGIWEAAAGAGALITLLLMLI 
LLVRLPFFKEKEKKSPVGLHFLFLLGTLOP 


1324 


2674 


A 


10336 


1 


932 


"ERLCFPCMQSKIYSYMSPNKCSGMRFPLgbb 

NSVTHHEVKCQGKPLAGIYRKREEKRNAGN 

AVRSAMKSEEQKIKDARKGPLVPFPNQKSEA 

AEPPKTPPSSCDSTNAAIAKQALKKPIKGKQA 

PRKKAQGKTQQNRKLTDFYPVRRSSRKSKAE 

LQSEERKRIDELIESGKEEGMKIDLIDGKGRG 

VIATKQFSRGDFVVEYHGDLIEITOAKKREAL 

YAQDPSTGCYMYYFQYLSKTYCVDATRETN 

RLGRLINHSKCONC(?rKLnUlLMJvrm-u-iAo 

RDIAAGEELLYDYGDRSKASIEAHPWLKH 


1325 


2675 


A 


10338 


3 


870 


" "PGSTISCSELKGTQCRAl AUSRGRRPPMi iJWL 
RGVTATFGRPAEWPGYLSHLCGRSAAMDLG 
PMRKSYRGDREAFEETHLTSLDPVKQFAAWF 
EEAVQCPDIGEANAMCLATCTRDGKPSARML 
LLKGFGKDGFRFFTNFESRKGKELDSNPFASL 
VFYWEPLNRQVRVEGPVKKLPEEEAECYFHS 
RPKSSQIGAWSHQSSVIPDREYLRKKNEELE 
QLYQDQEVPKPKSWGGYVLYPQVMEFWQG 
QTNRLHDRIVFRRGLPTGDSPLGPMTHRGEE 

DWLYERLAP 


1326 


2676 


A 


10344 


2 


'984 


■ ARAAAHCGICRLVRWWRKRRSVMGigrSPV 
LLASLGVGLVmGLAVGSYLVRRSRRPQVT 
LLDPNEKYIXRIXDKTTVSHNTKRFRFALPTA 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 



SEQID 
NO: of 
peptide 
seq- 
uence 



Met 
hod 



SEQ 
ID NO: 
in 
USSN 
09/496 
914 



Predicted . 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 



Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 



Amino acid sequence (A«Alanine OCystemc. 
D=Aspartic Acid, E=Glutaniic Acid, 
F=Phenylalanine, G<jlycine, H=Histidine, 
l=lsoleucine, K=Lysine, L=4^cine, 
M=Methionine, N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V=Valine, W=Tiyptophan, 
Y«Tyrosine, X=Unknown, ♦^Stop codon, 
/=possible nucleotide deletion, \Fpossible 
nucleotide insertion 



HHTLGLPVGKrni^imOGSLVIRPYTPVrsu 



EDQGYVDLYIKVYLKGVHPKFPEGGKMSQy 
LDSLKVGDWEFROPSOaTYTOKOHFNIQP 
NKKSPPEPRVAKKLGMIAGGTGrrPMLQLIRA 
ILKVPEDPTQCFLLFANQTEKDULREDLEELQ 
ARYPNRFKLWFTLDHPPKDWAYSKGFVTAD 
MIREHLPAPGDDVLVLLOOPPPMVQLACHPN 
LDKLGYSQKMRFTY 



1327 



2677 



"A I 10345 



968 



1328 



"2678 I 10346 



1329 126'^ 



173 1 439 

9sr 



1330 



2680 



rS I 10352 



34 



, 2573 



LQSAGEGVTHVLiLLESPARPVAAVrgvyKK 
RYHRLSDMSMLAERRRKQKWAVDPQNTAW 
SNDDSKFGQRMLEKMGWSKGKGLGAQEQG 
ATOHIKVQVH^WHLGLGATINNEDNWIAHQ 
DDFNQLLAELNTCHGQETTDSSDKKEKKSFS 
LEEKSKISKmVHYMKFTKGiO^LSSR^TDL 
DCIFGKRQSKKTPEGDASPSTPEENETTTTSAF 
TIQEYFAKRMAALKNKPQVPVPGSDISETQVE 
RKRGKKRNKEATGKDVESYLQPKAKRHTEG 
KPERAEAQERVAKKKSAPAEEQLRGPCWDQ 

SSKASAQDAGDHVQPA 



GSAAMKVKIKCWNOVATWLWVANDENUUi 
CRMAFNGCCPDCKVPGDDCPLVWGQCSHCF 
HIVfHCILKWLHAQQVQQHCPMCRQEWKFKE 



QMEPGNDTQISEFLLLGFSQKPGLQPFU^ULM. 
SMYLVTVLGNLLIILATTSDSHLHTPMYFFLSN 
LSFADICVTSTTIPKMLMNIQTQNKVrryiACL 
MQMYFFILFAGFENFLLSVMAYDRFVAICHP 
LHY\rmiNPHLCGLLVLASWTMSALYSLLQI 
LMVVRLSFCTALEIPHFFCELNQVXQLACSDSF 
LNHMVIYFTVALLGGGPLTGILYSYSKIISSIH 
AISSAQGKYKAFSTCASHLSWSLFYGAILGV 
YLSSAATRNSHSSATASVMYTVVTPMLNPFI 
YSLR NKDIKRALQIHLLWGTMKGQFFKKCP 
IPFLKSCCCCXXFDFPPPPLDQVQEEBCEVmV 
TEHGTPKPFRKFDSVAFGESQSEDEQFENDLE 
TDPPNWQQLVSREVLLGLKPCEIKRQEVINEL 
FYTERAHVRTLKVLDQVFYQRVSREGE^PSE 
LRKIFSNLEDILQLfflGLNEQNQCAVRKRNETS 
VIDQIGEDLLTWFSGPGEEKLKHAAATFCSNQ 
PFALEMKSRQKKDSRFQTFVQDAESNPLCRR 
LOLKDnPTQMQRLTKYPLLLDNIATYTBWPT 
EREKVKKAADHCRQILNYVNQAVKEAENKQ 
RLEDYQRRLDTSSLKLSEYPNVEEURNLDLTK 
RKMIHEGPLVWKVNRDKTIDLYTLLLEDILV 
LLQKQDDRLVLRCHSKILASTADSKHirSPVI 
KI^STVLVRQVATDNKALFVISMSDNGAQIYE 
LVAQTVSEKTVWQDUCRMAASVKEQSTKPI 
PLPQSTPGEGDNDEEDPSKLKEEQHGISVTGL 
OSPDRDLGLESTLISSKPQSHSLSTSOKSEVRD 
LFVAERQFAKEQHTDGTLKEVGEDYQIAIPDS 
HLPVSEERWALDALRNLGLLKQLLVQQLGLT 
EKSVQEDWQHFPRYRTASQGPQTDSVIQNSE 
NIKAYHSGEGHMPFRTGTGDIATCYSPRTSTE 
SFAPRDSVGLAPQDSQASNILVMDHMIMTPE 
MPTMEPEGGLDDSGEHFFDAREAHSDENPSaS 
GDGAVNKEEKDVNLRISGNYLILDGYDPVQE 
SSTDEEVASSLTLQPMTGIPAVES'mQQQHSP 
ONTHSDGAISPFTPEFLVQQRWGAMEYSCFEI 
QSPSSCADSQSQIMEYIHKIEADLEHLKKVEE 
SYmCQRLAGSALTDKHSDKS 
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SEQ ID 
NO: of 
nucl- 
eotide 
se<^ 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end .\ 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine 0=Cysteine, 
D=Aspaitic Acid, ENjlutamic Acid, 
F=Phenylalanine, G=<jlycinc, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leucine, 
M=Methioninc,N=Asparagine, P^Proline, 
Q=Glutamine, R«=Arginine, S=Serinc, 
T=Thrconinc, V^Valinc, W^Tryptophan, 
Y=Tyrosine, X=Unknown, ♦^Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 


1331 




A 


10353 


1 


2100 


AVEFAEGALTMAPWPELGDAQPNPDKYLEU 

AAGQQPTAPDKSKETNKTDKIWVTKIELLP 

SYSTATLIDEPTENOJDPWNLPTLQDSGIKWSE 

RDTKGKILCFFQGIGRLILLLGFLYFFVCSLDIL 

SSAFQLVGGKMAGQFFSNSSIMSNPLLGLVIG 

VLVTN^VQSSSTSTSIWSMVSSSLLTVRAAIP 

nMGANIGTSITNnVALMQVGDRSEFRRAFA 

GATVHDFFNWLSVLVLLPVEVATOYLEIITQL 

rVESFHFKNGEDAPDLLKVITKPFTKUVQLDK 

KVISQIAMNDEKAKNKSLVKIWCKTFTNKTQ 

INVTWSTANCTSPSLCWTIXJIQNWTMKNW 

YKJENIAKCQHIFVNFHLPDLAVGTILLILSLLV 

LCGCLIMIN^ILGSVLKGQVATmKTINTDFP 

FPFAWLTGYLAILVGAGMTFIVQSSSVFTSAL 

TPUGIGVrriEIUYPLTLGSNIGTnrTAILAAL 

ASPGNALRSSLQlALCHFFFNlSGILLWYPIPFr 

RLPIRMAKGLGNISAKYRWFAVFYLUFFFLIP 

LTVFGLSLAGWRVLVGVGVPWFniLVLCLR 

LLQSRCPRVLPKKLQNWNFLPLWMRSLKPW 

DAWSKFTGCFQMRCCCCCRVCCRACCLLC 

GCPKCCRCSKCCEDLEEAQEGQDVPVKAPET 

FDNinSREAQGEVPASDSKTECTAL 


1332 

• 


26S2 


A 


10354 


30 


1377 


"SQQGSQPHRQGPPSLLTAPHSLDLPALPPGPR 
GSQGKLRRVLVPMSVKPSWGPGPSEGVTAVP 
TSDLGEIHNWTELLDLFNHTLSECHVELSQST 
KRWLFALYLAMFWGLVENLLVICVNWRG 
SGRAGLMNLYILNMAIADLGIVLSLPVWMLE 
VTLDYTWLWGSFSCRFTHYFYFVNMYSSIFF 
LVCLSVDRYVTLTSASPSWQRYQHRVRRAM 
CAGIWVLSAIIPLPEWHIQLVECffEPMeLFM 
APFKITSWALAVALSrnLGFLLPFPLirVFN 
VLTACRLRQPOQPKSRRHCLLLCAYVAVFV 
MCWLPYHVTLLLLTLHGTHISLHCHLVHLLY 
FFYDVIDCFSMLHCVINPILYNFLSPHFRGRLL 
NAWHYLPKDQTKAOTCASSSSCSTQHSnrr 
KGDSQPAAAAPHPEPSLSFQAHHLLPNTSPISP 
TQPLTPS 


1333 


2683 


A 


10358 


2 


884 


^GAGADGREPASERASRAEPPAVAMGQNU 
LMGTAEDFADQFLRVTKQYLPHVARLOJST 
FLEDGIRMWFQWSEQRDYIDTTWNCGYLLA 
SSFVFLNLLGQLTGCVLVLSRNFVQYACFGLF 
GIlALQTlAYSILWDLmMBNLALGGGLUX 

laesrseoksmfagvp™resspkqymqlgg 

RVLLVLMFMTLLHFDASFFSIVQNIVOTAlJvn 

lvajgfktklaaltlwwlfainvyfnafwt 
ipvykpmhdflkydffqtmsvigglllwal 
gpggvsmdekkkew 


1334 


2684 


A 


10367 


59 


1562 


' QAWSLQVALSPFFFPASPSNSFAAAVFQLLTl' 
ELPLPHVPGQESAKRRSARRFLIMSELTECH-M 

elvwgtksspglsdtffcrwtqgfvfseseqs 

aleqfeggpcaaoapvqafllkkllfssekss 

wrdcsqeeqkellchtlcdilesaccdhsgs 

yclvswlrgktteetasisgspaesscxjvehs 

salaveelgferfhaliqkrsfrslpelkdav 

ldqysmwgnkfgvllflysvlltkgienikn 

eiedaseplidpvyghgsqsunllltghavsn 

vwixjdrecsgmkixgiheqaavgfltlmea 

lryckvgsylkiskipyldclasethltvffa 

kdmalvapeapseqarrvfqtydpedngfip 

dslxedvmkaujlvsdpeyinudknk^ 
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SEQID 5 
NO: of 1 
nucl- ] 
cotide 2 
seq- 1 
uence 


JEQID f 
^0:of \ 
peptide 
wq- 
jeace 


lod ] 


SEQ 1 
[DNO: 1 
n 1 
USSN 
09/496 
914 


'redicted 1 

beginning 

nucleotide 

ocation 

wrrcspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
lucieouuc 
ocation 
corresponding 
to last ambo 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine CM^ysteme, 
[)=Aspartic Acid, E^Glutaniic Acid, 
F^Phcnylaianine, G^Glycine, H=Hfatidine, 
Msolcucine, K=Lysine, lM«eucine, 
M=Methionine. N=Aspaiagine, P=Proline, 
Q=Glutaminc, R^Arginine, &«Scrine, 
T=Threonine, V=:Valine, W-Tiyptophan, 
Y=Tyrosine, X=Unlaiowii, ♦^Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














IGULLGPFLQEITPDQQSSGPESFTVYHYNGL 
KQSlWTEKVNm^EGTAN^GFEDPMLQTD 
DTPDCRCLOTKWPyiELLWTrDRSPSLN 


1335 


2685 


A 


10375 


82 


2929 


totkriu.grekamaspprgwgcoelllpl'ml 

lgtlcepgsgqirysmpeeidkgsfvgniakd 

lglepqelaergvrivsrgrtqlfalwprsgs 

lvtagridreelx:aqsplcvvnfnilvenkm 

kiygveveiidindnfprfrdbelkvkvnena 

aagtrlvlpfardadvgvnslrsyqlssnlh 

fsldwsgtdgqkypelvleqpldreketvh 

DLLLTALDGGDPVLSGTTHIRVTVLDANDNA 

PLFTPSEYSVSVPENIPVGTRLLMLTATDPDE 

GrNGKLTYSFRNEEEKISETFQLDSNLGEISTL 

QSLDYEESRFYLMEWAQDGGALVASAKW 

VTVQDVNDNAPEVILTSLTSSISEDCLPGTVIA 

LFSVHIXjDSGENGEIACSIPRNLPFKLEKSVD 

^^^imLTTRDLDREETSDYNlTLTVMDHGTP 

PLSTESHIPLKVADVNDNPFNFPQASYSTSVT 

ENNPRGVSIFSVTAHDPDSGDNARVTYSLAE 

EOTQGAPLSSYVSINSDTGVLYALRSFDYEQL 

RDLQLWVTASDSGNPPLSSNVSLSLFVLDQN 

DNTPEELYPALPTDOSTGVELAPRSAEPGYLV 

mj-«r«r ATrrNVrkC/^rVKlAWI QYHT I If A55EPGLFA 

VGLHTOEVRTARALLDRDALKQSLWAVED 

HGQPPLSATFTVTVAVADRIPDILADLGSIKTP 

IDPEDLDLTLYLWAVAAVSCVFLAFVIVLLV 

LRLRRWHKSRLLQAEGSRLAGVPASHFVGV 

DGVRAFLQTYSHEVSLTADSRKSHLIFPQPNY 

AT^TT T QPCQrpvWPT I MSDKVDANKEERRV 

QQAPPNTDWRFSQAQRPGTSGSQNGDDTGT 

wpnnqferremlqamilasaseaadgsstlgg 

gagtmglsarygpqftlqhvlqgelgsdyr 

qnvyipgsnailtnaagkrdgkapaggngn 

KKKSGKKEKK 


1336 


2686 


A 


10379 


1 


557 


• RPRRRQPSFSCRVLVLEDPPCFRFTNSMMVJiiN 
LAKLQAQVRIGGKGTARRKKKWHRTATAD 
DKKXQSSLKIOAVNNIAGIEEVNMIKDDGTVI 
HFNNPKVQASLSANTFAITGHAEAKPITEMLP 
GILSQLGADSLTSLRKLAEQFPRQVLDSKAPK 
PEDIDEEDDDVPDLVENFDEASKNEAN 


1337 


2687 


A 


10380 


1 

• 


1263 


■ ■ ipgstiswspaaarglsvcrccrlhpasamul 
fgdlpepersprpaagkeaqkgpllfddlppa 

<5<5TnSGSGGPLLFDDLPPASSGDSGSLATSISQ 

mvktojkgakrktseeekngseelvekkvc 

kassvifglkgyvaerkgereemqdahviln 

diteecrppsslnrvsyfavfdghggiraskf 

aaqnlhqnlirkfpkgdvisvektvkrclld 

tfkhtdeefikqassqkpawkdgstatcvla 

vdne-yianlgdsrailcryneesqkhaalsl 

SKEHNPTQYEERMRIQKAijuiS vkuuk v laj v 
LEVSRSIGDGQYKRCOVTSVPDIRRCQLTPND 
RFUXACDQLFKVFTPEEAVNFILSaEDEKIQ 
TREGKSAADARYEAACNRLANKAVQRGSAD 

>IYTYMVVlllfiH 


1338 


2688 


A 


10385 


3 


■ 589 


- GPSQSMAAGELEGGKPLSGLLNALAQDirHU 

ypgiteellrsqlypevppeefrpflakmrgil 

KSlASADMDFNQLEAFLTAQTKKQOGrrSDQ 

aaviskfwkshktkireslmnqsrwnsglrg 
lswrvdgksqsrhsaqihtpvaiielelgkyg 
oeseflclefdevkvnqilktlseveesistlis 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USbN 

09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 

nucleotide 

location 

mrrrsnnndi nP 

to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A»Aianine OCystemei 
D-Aspartic Acid, E=GIutamic Acid, 
F«PheDy]alaninei| Q=Q\ycmt, H^Histidine^ 
I-Isol6ucine» KfLysbe, IMxucme, 
M=Melhionine» N=»Asparagine» P=PiDline^ 
Q^lutamine, R^Aiginine, SpSerine, 
T='Threonine, Vi=VaIine, W=Tiyptophan, 
Y=Tyrosine, X^Unknown, ♦«Stop codon, 
/^possible nucleotide deletion, V=^ssiblc 
nucleotide insertion 
QPN 


1339 


2689 


A 


10386 


50 


390 


lgamakhhpdlifcrkqAgVaigrlcekcdg 
kcvicdsyvrpctlvricdecnygsyqgrcvi 
cggpgvsdayyckecnqekdrdgcpkivnl 
gssktdlfyekkkygfkkr 


1340 


2690 


A 


10388 


113 


3472 


SQLRKGASATHSSPSRTDCIAQMMDIYVCLK 

RPSWMVDNKRMRTASNFQWLLSTFILLYLM 

NQVNSQKKGAPHDLKCVTNNLQVWNCSWK 

APSGTGRGTDYEVCIENRSRSCYQLEKTSIKIP 

ALSHGDYBITINSLHDFGSSTSKFTLNEQNVSL 

IPDTPEELNLSADFSTSTLYiKWNDRGSVFPHR 

SNNOWEIKVLRKESMELVKLVTHNTTLNGKD 

TLHHWSWASDMPLECAIHFVEIRCYIDNLHFS 

GLEEWSDWSPVKNISWDSQTTCVFPQDKVIL 

VGSDrrFCCVSQEKVLSALlOHTOCPLIHUXJE 

NVAIKJRNISVSASSGTNVVFTTEDNIFGTVIF 

AGYPPDTPQQLNCETHDLKEnCSWNPGRVTA 

LVGPRATSYTLViSFSGKYVRLXRAEAPTNES 

YQLLFQMLPNQEIYNFTLNAHNPLGRSQSTIL 

VNnEKVYPHTPTSFKVKDINSTAVKl^WHLP 

GNFAKINF1X3EIEIKKSNSVQEQRNVTIKGVE 

NSSYLVALDKLNPYTLYTFRIRCSTETFWKW 

SKWSNKKQHLTTEASPSKGPDTWREWSSDG 

KNLIIYWKPLPINEANGKILSYNVSCSSDEETQ 

SLSEIPDPQHKAEIRLDKNDYnSWAKNSVGS 

SPPSKIASMEIPNDDLKIEQVVGMGKGILLTW 

HYDPNMTCDYVIKWCNSSRSEPCLMDWRKV 

PSNSTETVIESDBFRPGIRYNFFLYGCRNQGY 

QLLRSMIGYIEELAPIVAPNFTVEDTSADSILV 

KWEDIPVEELRwrLWj X Lr i rvj^urjvu i oxsjv* 

RVLESGRSDIKVKNITDISQKTLRIADLQGKTS 

YHLVLRAYTDGGVGPEKSMYVVTKENSVGL 

IIAILIPVAVAVIVGVVTSILCVIIKREWIKETFY 

PDIPNPENCKALQFQKSVCEGSSALKTLEMNP 

CIPNNVBVLETRSAFPKBBDTEIVSPVAERPEN 

RSDAKPENHWESYCPPnEEBIPNPAADETGG 

TAQVIYIDVQSMYQPQAKPEEEQENDPVGGA 

GYKPQMHLPINSTVEDIAAEEDLDKTAGYRP 

QANVNTWNLVSPDSPRSIDSNSEIVSFGSPCSI 

NSRQFLIPPKDEDSPKSNOOOWSFTNFFQNKP 

ND 


1341 


2691 


A 


10392 . 


I 


5057 


MUTKHLSATKPKKSWAPNLYELDSDL'lKlil' 

DVnGEGPTDSEFFHQRFRNLIYVEFVGPRKTL 

KUO^CLDWLQPETRTKEEIIELLVLEQYLTII 

PEKLKPWVRAKKPENCEKLVTLLENYKEMY 

QPEGESLHGVLWSAGLRCPLGLSASTLLTW 

SGLDNSLSWAAVGMSCVLWDIELHHDFLGV 

ATKSVSTHAQGDAAQGLGGTIVRMWARDSN 

LATGVLLDDNNSDVTSDDDMTRNRRESSPPH 

SVHSFSGDRDWDRRGRSRDTEPRDRWSHTR 

NPRSRMPPRDLSLPWAKTSFEMDREDDRDS 

RAYESRSQDABSYQNWDLAEDRKPHNTIQD 

KMErratHXSUJVQl^DDGHSHMTQGHSS 

RSKRSAYPSTSRGLKTMPEAKKSTHRRGICED 

ESSHGVIMEKFIKDVSRSSKSGRARESSDRSQ 

RFPRMSDDNWKDISLNKRESVIQQRVYEGNA 

FRGOFRFNSTLVSRKRVLERKRRYHFDTDGK 

GSIHDQKGCPRKKPFECGSEMRKAMSVSSLS 

SLSSPSFTESQPIDFGAMPYVCDECGRSFSVIS 

EFVEHQIMHTRENLYBYGESFIHSVAVSEVQK 
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"SEQ n> 1 s 
MO: of h 
nuci- I 
eotide i 
seq- 1 y 
uence 


NIO: of \ 
>eptide 
»eq- 
iience 


lod I 

i 
\ 
( 
< 


JEQ ] 
DNO: 1 
n 1 
JSSN 
39/496 
?14 


Predicted | 1 
segiiming i 
lucieotide 
ocation 
conespondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alanine C>=Cysteine. 
[>=Aspartic Acid, EF=Glutamic Acid, 
F=Phenylalanine, GKJIycine, H=Histidine, 
Nlsoleucine, K=Lysine, L=Leucine, 
M=Methionine, N=Asparagine, P=ProIine, 
Q=Glutamine, R=«Arginine, S==Scrine, 
T^Threoninc. V^Valine, W^Tiyptophan. 
y^yrosine, X^Unknown, ♦=Stop codon, 
/=possible nucleotide deletion, V=^>ossible 
nucleotide insertion 














SQVGGKRFECKDCGETFNKSAALAEHRKIHA 

RGYLVECKNQECEEAFMPSPTFSELQKIYGK 

DKFYECRVCKEiaHSSALIEHQKIHFGDDKD 

NEREHERERERERGETFRPSPALNEFQKMYG 

KEKMYECKVCGETFLHSSSLKBHQKIHTRGN 

PFENKGKVCEETFIPGQSLKRRQKTyNKEKLC 

DFTDGRDAFMQSSELSEHQKIHSRKNLFEGR 

GYEKSVIHSGPFTESQKSHTITRPLESDEDEKA 

FTTSSNPYENQKIPTKENVYEAKSYERSVIHSL 

ASVEAQKSHSVAGPSKPKVMAESTIQSFDAIN 

HQRVRAGGNTSEGREYSRSVIHSLVASKPPRS 

HNGNELVESNEKGESSmSDLNDKRQKIPAR 

ENPCEGGSKNKNYEDSVIQSVFRAKPQKSVP 

GEGSGEFKKDGEFSVPSSNVREYQKARAKKK 

YIEHRSNETSVmSLPFGEQTFRPRGMLYECQ 

ECGECFAHSSDLTEHQKIHDREKPSGSRNYE 

WSVmSLAPTDPQTSYAQEQYAKEQARNKCK 

DFRQFFATSEDLMTNQKJYDQEKSHGEESQGE 

NTDGEETHSEETHGQETIEDPVIQGSDlvlEDPQ 

KDDPDDKIYECEDCGLGFVDLTDLTDHQKVH 

SRKCLVDSREYTHSVIHTHSISEYQRDYTGEQ 

LYECPKCGESFIHSSFLFEHQRIHEQDQLYSM 

KGCDDGFIALLPMKFRRNRAAERNPALAGSA 

IRCLLCGQGFIHSSALNEHMRLHREDDLLEQS 

OMAEEAIIPGLALTEFQRSQTEERLFECAVCX} 

ESFVNPAELADHVTVHKNEPYEYGSSYTHrS 

FLTEPLKGAIPFYECKDCXiKSFIHSTVLTKHKE 

LHLEEEEEDEAAAAAAAAAQEVEANVHVPQ 

WLRIQGLNVEAAEPEVEAAEPEVEAAEPEV 

EAAEPNGEAEGPDGEAAEPIGEAGQPNGEAB 

QPNGDADEPDGAGIEDPEERAEEPEGKAEEPE 

GDADEPDGVGIEDPEEGEDQEIQVEEPYYDC 

HECTETFTSSTAFSEHLKTHASMIIFEPANAFG 

ECSGYIERASTSTGGANQADEKYFKCDVCGQ 

LFNDHLSLARHQNTHTG 


1342 


2692 


A 


10393 


2 


1350 


"GRPRSSSDNRNFLRERAGLSSAAVQTR1UN5A 
ASRRSPAARPPVPAPPALPRGRPGTEGSTSLS 
APAVLWAVAVWVVVSAVAWAMANYIHV 
PPGSPEVPKLNVTVQDQEEEIRCREGALSLLQ 
HLRPHWDPQEVTLQLFTDGrmKUGCYVGN 
TMEDWLVRIYGNKIELLVDRDEEVKSFRVL 
OAHGCAPQLYCTFNNGLCYEnQGEALDPKH 
VChn>AIFRLIARQU^KIHAIHAHNGWIPKSNL 
WLKMGKYFSUPTGFADEDINKRFLSDIPSSQI 
LOEEMTWMKEILSNLGSPVVLCHNDLLCXNn 
YNEKQGDVQFIDYEYSGYNYLAYDIQNHFNE 
FAGVSDVDYSLirUKJbLrV^oy wi^RA i j^on. * 

efkgfgtevtekeveilfiqvnqfalashffw 
glwaliqakystiefdflgyaivrfnqyfkm 

kprvtalkvpe 


1343 


2693 


A 


10394 


102 


839 


- T»EAQTS AVlJ\REKGHIJnl4RHEAPMQMASA 
ODARYGQKDSSDQNFDYMFICLmGNSSVGK 
ireFLFRYADDSFTSAFVSTVOIDFKVKTVFKN 
EKRIKLQIWDTAGQERYRTrrrAYYRGAMGFI 
LNfYDITNEESFNAVQDWSTQIKTYSWDNAQ 
VILVGNKCDMEDERVISTERGQHLGEQLGFE 
FFETSAKDNINVKQTFERLVDnCDKMSESLET 
DPAirAAKONTRLKETPPPPOPNCAC 


1344 


2694 


A 


10395 


2 


4136 


" DRPPWNSRVDDFVTNLIHLSSKGHISPAJaiTS 
LQQRTPAEMSPVLHFYVRPSOHEOAASGHrR 
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DUCl- { 
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Predicted 1 
}eginiimg i 
ludeotide 
ocatioa 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


=»rcdicted end 
lucleotide 
ocation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alaninc C=Cysteine, 
3=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine. G=Glycine, H=Histidine, 
I=Isolcucine, K=Lysine. L=Uucine, 
M=^ethionine, N^Asparagine, P=Proline, 
Q=Glutamine, R«Arginine, S=Scrine, 
T-Thnsonine, V«Valinc, W^Tryptophan, 
Y=^yrosine, X=Unknown, *=6top codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














RKLQGKLPELQGVb-lELCYNVNWTAEALFtiA 

EETKiaMWLFGCPLLLDDVARESWLLPQSN 

DLLLEVOPKLNFSTTTSTNIVSVCRATGLGPV 

DRVETTRRYRLSFAHPPSAEVEAIALATXHDR 

MTEQHFPHPIQSFSPESMPEPLNGPINfl.GEGR 

LALEKANQELGIJaDSWDLDFYTKRFQELQR 

NPSTVEAFDLAQSNSEHSRHWFFKGQLHVDG 

QKLVHSLFESIMSTQESSNPNNVLKFCDNSSA 

IQGKEVRFLRPEDPTRPSRFQQQQGLRHWFT 

AETHNFPTGVCPFSGATTGTGGRIRDVQCTG 

RGAHWAGTAGYCFGNLHIPGYNLPWEDLSF 

QYPGNFARPLEVAIEASNGASDYGNKFGEPV 

LAGFARSLGLQLPDGQRREWIKPIMFSGGIGS 

MEADHISKEAPEPGMEWKVGGPVYRIGVGG 

GAASSVQVQGDNTSDLDFGAVQRGDPEMEQ 

KMNRVIRACVEAPKGNPICSLHDQGAGGNG 

NVLKELSDPAGAnYTSRFQLGDPTLNALEIW 

GAEYQESNALLLRSPNRDFLTHVSARERCPA 

CFVGTITGDRRIVLVDDRECPVRRNGQGDAP 

PTPPPTPVDLEJLEWVLGKMPRKEFFLQRKPP 

MLQPLALPPGLSVHQALERVLRLPAVASKRY 

LTNKVDRSVGGLVAQQQCVGPLQTPLADVA 

WALSHEEUGAATALGEQPVKSLLDPKVAA 

RLAVAEALTNLVFALVTDLRDVKCSGNWM 

WAAKLPGEGAALADACEAMVAVMAALGVA 

VDGGKDSLSMAARVGTETVRAPGSLVISAYA 

VCPDITATVTPDLKHPEGRGHLLYVALSPGQ 

HRLGGTALAQCFSQLGEHPPDLDLPENLVRA 

FSITQGLLKDRLLCSGHDVSDGGLVTCLLEM 

AFAGNCGLQVDVPVPRVDVLSVLFAEEPGLV 

LEVOEPDLAQVLKRYRDAGLHCLELGHTGE 

AGPHAMVRVSVNGAWLEEPVGELRALWEE 

TSFQLDRLQAEPRCVAEEEROLRERMOPSYC 

LPPTFPKASVPREPGGPSPRVAILREBOSNGDR 

EMADAFHLAGFEVWDVTMQDLCSOAIGLDT 

FRGVAFVGGFSYADVLGSAKGWAAAVTFHP 

RAGAELRRFRKRPDTFSLGVCNGCQLLALLG 

WVGGDPNEDAAEMGPDSQPARPGLLLRHNL 

<inRYFSRWASVRVGPGPALMLRGMEOAVLP 

VWSAHGEGYVAFSSPELQAQIEARCaAPLHW 

ADDDGNPTEQYPLNPNGSPGGVAGlCSCaDGR 

HLAVMPHPERAVRPWQWAWRPPPFDTLTTS 

PWLQLFINARNWTLEGSC 


1345 


2695 


A 


10396 


65 


642 


■ GVRGFWAGTMASRAGPRAAOTDGSUfiiHKls 
RVAMHYQMSVTLKYEKKLIYVHLVIWLLLV 
AKMSVGHLRLLSHDQVAMPYQWEYFVLLSI 
LPSLLGLLSFPRNNlo I L V LroMiamvjuroirLr A-* 
YGSMEMFPAAQQLYRHGKAYRFLFGFSAVSI 
MYLVLVLAVQVHAWQLYYSKKLLDSWFTST 

QEKKHK 

- ■ DDFVRCGPQSAAMGASARLLRAVIMljAPGS 


1346 


2696 




" 10398 




""tTS 


GKGTVSSRITTHFELKHLSSGDLLRDNMLRGT 

EIGVLAKAFIDQGKLIPDDVMrRlj\LHELKNL 

TOYSVaLIXJFPRTIJKJAEALDRAYQIDTVINL 

NVPFEVnCQRLTARWIHPASGRVYNIEFNPPK 

TVGIDDLTGEPLIQREDDKPETVKRLKAYED 

QTKP^^-EYYQKKGVI^SGTETNKIWInr^ 

AFLOTKVPORSOKASVTP 


1347 


2697 


A 


10402 


153 


1969 


- - KHRQENNAU^MAPEIHMTUPMCLIEN 1 NOa, 
VANPEALKILSAITQPWWAIVGLYRTGKSY 
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< 

1 

1 


5EQID 1 S 
HO:aS 1 
aucl- 1 
sotide 
seq- 1 
uenoe 1 


5EQID I 
^0: of 1 
peptide 
seq- 
uence 


Viet 1 ! 
lod 1 


5EQ 

[DKO: 

m 

USSN 
09/496 
914 


Predicted 
beginning 
nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence _ 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A«Alamne C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, CHGlycinc, H=*[istidine. 
Msoleucine, K=Lysine, L=Leucinc, 
M^Mcthionine, N=Asparagine, P=Proline, 
Q=Glutaniine, R«Arginine, S=Serine, 
T=Thieonine, V«Valine, W=Tryptophan, 
Y«Tyrosine, X=Unknown, ♦"Stop codon, 
/=possible nucleotide deletion, NF^JOSslble 
nucleotide insertion 
















LMNKLAC^KGFSLGSTVKSHTKGIWMWCV 

PHPKKPEHTLVLLDTEGLGDVKKGDNQNDS 

WnTLAmSSTLVYNSMGTINQQAMDQLYY 

VmXHRIRSKSSPDENENEDSADFVSFFPDFY 

WTLRDFSLDLEADGQPLTPDEYLEYSLKLTQ 

GTSQKDKNFNLPRLCIRKFFPKKKCFVFDLPI 

HRRKLAQLEKLQDEELUPbi" v v Aur K^mi 

FSNSKTKTLSGGIKVNGPRLESLVLTYINAISR 

GDLPCMENAVLALAQIENSAAVQKAIAHYD 

QOMGQKVQLPAETLQELLDLHRVSEREATEV 

YMKNSFKDVDHLFQKKLAAQLDKJCRDDFCK 

ONQEASSDRCSALLQVIFSPLEEEVKAGIYSK 

PGGYCLFIQKLQDLEKKYYEEPRKGIQAEEEL 

OTYLKSKESVTDAILQTDQILTEKEKEIEVEC 

VKAESAQASAKMVEEMQIKYQQMMEEKEKS 

YOEHVKQLTEKMERERAQLLEEQEKTLTSKL 

QEQARVLKERCQGESTQLQwEIQia-QKTUfLK. 

KTKRYMSHKLKl 


1348 


2698 


A 


10404 


5 


892 


"TOLPAPLSGVLSRLQLGSGAPLLTWVQEIAU 
VAGGAPRRRTPVTMWRLLARASAPLLRVPLS 
DSWALLPASAGVKTLLPVPSFEDVSIPEKPKL 
RFIERAPLVPKVRREPia^SDIRGPSTEATEFT 
EGNFAILAUK5GYLHWGHFEMMRLTINRSM 
DPKNMFAIWRVPAPFKPITRXSVGHRMGGGK 
GAIDHYVTPVKAGRLWEMGGRCEFEEVQG 
FLDQVAHKLPFAAKAVSRGTLEKMRKDQEE 
RERNNQNPWTFERIATANMLQIRKVLSPYDL 
THKGKYWGKFYMPKRV 

"?"Doivr7=^CAT7^nT TJnTlTSDMKGSYPVWELIMWlC 


1349 


2699 


A 


10409 


59 


1184 


AGKLQSQLRTTWAAAAFLDAFQKVADMAT 
NTRGGTOEIGSALTRMCMRHRSIEAKLRQFSS 
ALIDCLINPLQEQMEEWKKVANQLDKDHAK 
EYKKARQEIKKKSSDTLKLQKKAKKGRODIQ 
PQLDSALQDVNDKYLIXEETEKQAVRKAUE 
ERGRFCrnSMLRPVIEEEISMLGEnHLQTISE 
DLKSLTMDPHKLPSSSEQVILDLKGSDYSWS 
YOTPPSSPSTTMSRKSSVCSSLNSVNSSDSRSS 
GSHSHSPSSHYRYRSSNLAQQAPVRLSSVSSH 
DSGFISQDAFQSKSPSPMPPEAPNQRRKEKRE 
PDPNGGGPTTASGPPAAAEEAQRPRSM 
■ AGRGGPGKPVSWSSUPOSKKFQRKsWVK^^i 




1350 


2700 




10410 


'TTI 


"l58 " 


RGHSSLLPPSQDFVAGLSVILRGTVDDRLNW 
AFNLYDLl^GCITKEEMLDIMKSIYDMMG 
KYTYPALREEAPREHVESFFQKMDRNKDGV 
VTIEEFIESCQKDENIMRSMQLFDKVI 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected fix)m the group 
consisting ofSEQ ID NO: 1-1350. a mature protein coding portion ofSEQ ID NO: 1-1350. an 
active domain of SEQ ID NO: 1-1350, and compiementary sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 

conditions. 

3 Anisolatedpolynucleotideencodingapolypeptidevrilhbiologicalactivity, whereinsaid 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1 . 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA, 

5. An isolated polynucleotide of claim 1 wherein said polymicleotide comprises the 
complementary sequences. 

6. A vector comprising die polynucleotide of claim I . 

7. An expression vector comprising the polynucleotide of claim 1. 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1. 

9. A host cell geneticaUy engineered to comprise the polymicleotide of claim 1 operatively 
associated with a regulatory sequence that modulates expression of the polynucleotide in the host 
cell. . 

10. An isolatedpolypeptide. whereinthepolypeptideis selected from the group consistingof: 

(a) apolypeptideencodedby any one of the polynucleotides of claim 1; and 

(b) apolypeptideencodedbyapolynucleotidehybridizingunderstringentconditions 

with any one of SEQ IDNO:1-1350. 

11. Acomposition comprisingthepolypeptideof claim 10 andacarrier. 

12. An antibody directed against the polypeptide of claim 10. 
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13. A method for detecting the polynudeotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and fonns a complex 

with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the polynudeotide 

of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
nucleic acid primers that amieal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least aportioriof the polynucleotide of 

claim I; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 



1 5 The method of claim 1 4. wherein the polynucleotide is an RNA molecde and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

1 6. A method for detecting the polypeptide of claim 10 in a sample, comprismg: 

a) contacting the sample with a compound fliat binds to and forms a complex 

with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 1 0 is detected. 

17. A method for identic a compound that binds to the polypeptide of claim 1 0. 
comprising: 

a) contacting the compound with tiie polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that ifthe polypeptide/compound complex is 

detected, a compound that binds to tiie polypeptide of claim 10 is identified. 

18. Ametiiodforidentifyingacompoundtiiatbindstothepolypeptideof claim 10, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10, m a ceU, under 
conditions sufficient to fonn apolypeptide/compound complex, wherein the complex drives 

expression of a reporter gene sequence in tiie cell; and 

b) detecting flic complex by detecting reporter gene sequence e^qjression, so 

that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 
of claim 10 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell conq)rising a polynucleotide sequence selected from 
the group consisting of apolymicleotide sequence of SEQ IDNO: 1-1350, amature protein 
coding portion of SEQ ID NO: 1-1350, an active domain of SEQ ID NO: 1-1350, 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1.-1350. under conditions sufficient to express the polypeptide in said 
cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1351-2700. the mature protein portion thereof or the active domain 

thereof. " 

21. The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide array. 

22. A collection of polynucleotides, wherein the collection comprises the sequence 
iiiformationofatleastoneofSEQIDNO: 1-1350. 

23. nie collection of daim 22, wherein lie coUection is provided on a nucleic acid anay. 

24. The coUection of claim 23. wherein the array detects full-matohes to any one of tiie' 
polynucleotides in the collection. 

25. The collection of claim 23, y/bstda the anay detects mismatches to any one of the 
polynucleotides in the collection. 

26. The coUection of claim 22, wherein the coUection is provided in a computer-readable 
formaL 
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27. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising a polypeptide of claim 10 or 20 and a 
pharmaceutically acceptable carrier. 

28 . A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition compridng an antibody 4at specificaUy binds to a 



polypeptide of claun 10 or 20 and a pharmaceutically acceptable 



earner. 
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